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if) (54) Title: PURIFIED HEPATITIS C VIRUS ENVELOPE PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 
IT) 

(57) Abstract: The present invention relates to a method for purifying recombinant HCV single or specific oligomeric envelope 
£J proteins selected from the group consisting of El and/or E2 and/or EI/E2, characterized in that upon lysing the transformed host 
cells to isolate the recombinantly expressed protein a disulphide bond cleavage or reduction step is carried out with a disulphide bond 
cleavage agent. The present invention also relates to a composition isolated by such a method. The present invention also relates to 
the diagnostic ad therapeutic application of these compositions. Furthermore, the invention relates to the use of HCV El protein and 
^* peptides for prognosing and monitoring the clinical effectiveness and/or clinical outcome if HCV treatment. 
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PURIFIED HEPATITIS C VIRUS ENVELOPE PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 
Field of the invention 

5 

The present invention relates to the general fields of recombinant protein expression, purification of 
recombinant proteins, synthetic peptides, diagnosis of HCV infection, prophylactic treatment against HCV 
infection and to the prognosis/monitoring of the clinical efficiency of treatment of an individual with chronic 
hepatitis, or the prognosis/monitoring of natural disease. 

10 More particularly, the present invention relates to purification methods for hepatitis C virus envelope 

proteins, the use in diagnosis, prophylaxis or therapy of HCV envelope proteins purified according to the 
methods described in the present invention, the use of single or specific oligomeric £1 and/or E2 and/or E1/E2 
envelope proteins in assays for monitoring disease, and/or diagnosis of disease, and/or treatment of disease. 
The invention also relates to epitopes of the E1 and/or E2 envelope proteins and monoclonal antibodies thereto, 

15 as well their use in diagnosis, prophylaxis or treatment. 

Background of the invention 

The E2 protein purified from cell lysates according to the methods described in the present invention 

2 0 reacts with approximately 95% of patient sera. This reactivity is similar to the reactivity obtained with E2 secreted 

from CHO ceils (Spaete et al., 1992). However, the intracellulariy expressed form of E2 may more closely 
resemble the native viral envelope protein because it contains high mannose carbohydrate motifs, whereas the 
E2 protein secreted from CHO cells is further modified with galactose and sialic acid sugar moieties. When the 
aminoterminal half of E2 is expressed in the baculovirus system, only about 13 to 21% of sera from several 
25 patient groups can be detected (Inoue et al., 1992). After expression of E2 from E. coli, the reactivity of HCV 
sera was even lower and ranged from 14 (Yokosuka et al., 1992) to 17% (Mite et al., 1992). About 75% of HCV 
sera (and 95% of chronic patients) are anti-E1 positive using the purified, vaccinia-expressed recombinant E1 
protein of the present invention, in sharp contrast with the results of Kohara et al. (1992) and Hsu et al. (1993). 
Kohara et al. used a vaccinia-virus expressed E1 protein and detected anti-E1 antibodies in 7 to 23% of 

3 0 patients, while Hsu et al. only detected 1 4/50 (28%) sera using baculovirus-expressed E1 . 

These results show that not only a good expression system but also a good purification protocol are 
required to reach a high reactivity of the envelope proteins with human patient sera. This can be obtained using 
the proper expression system and/or purification protocols of the present invention which guarantee the 
conseivation of the natural folding of the protein and the purification protocols of the present invention which 
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guarantee the elimination of contaminating proteins and which preserve the conformation, and thus the reactivity 
of the HCV envelope proteins. The amounts of punned HCV envelope protein needed for diagnostic screening 
assays are in the range of grams per year. For vaccine purposes, even higher amounts of envelope protein 
would be needed. Therefore, the vaccinia virus system may be used for selecting the best expression constructs 
5 and for limited upscaling, and large-scale expression and purification of single or specific oligomeric envelope 
proteins containing high-mannose carbohydrates may be achieved when expressed from several yeast strains. 
In the case of hepatitis B for example, manufacturing of HBsAg from mammalian ceils was much more costly 
compared with yeast-derived hepatitis B vaccines. 

10 Aims of the invention 

It is an aim of the present invention to provide a new purification method for recombinanily expressed 
El and/or E2 and/or E1/E2 proteins such that said recombinant proteins are directly usable for diagnostic and 
vaccine purposes as single or specific oligomeric recombinant proteins free from contaminants instead of 
15 aggregates. 

It is another aim of the present invention to provide compositions comprising purified (single or specific 
oligomeric) recombinant Et and/or E2 and/or E1/E2 glycoproteins comprising conformational epitopes from the 
E1 and/or E2 domains of HCV. 

It is yet another aim of the present invention to provide novel recombinant vector constructs for 

2 0 recombinant^ expressing E1 and/or E2 and/or E1/E2 proteins, as well as host cells transformed with said vector 

constructs. 

It is also an aim of the present invention to provide a method for producing and purifying recombinant 
HCV E1 and/or E2 and/or E1/E2 proteins. 

It is also an aim of the present invention to provide diagnostic and immunogenic uses of the 
25 recombinant HCV E1 and/or E2 and/or E1/E2 proteins of the present invention, as well as to provide kits for 
diagnostic use, vaccines or therapeutics comprising any of the recombinant HCV E1 and/or E2 and/or E1/E2 
proteins of the present invention. 

It is further an aim of the present invention to provide for a new use of El, E2, and/or E1/E2 proteins, or 
suitable parts thereof, for monitoring/prognosing the response to treatment of patients (e.g. with interferon) 

3 0 suffering from HCV infection. 

It is also an aim of the present invention to provide for the use of the recombinant E1 , E2, and/or E1/E2 
proteins of the present invention in HCV screening and confirmatory antibody tests. 

It is also an aim of the present invention to provide E1 and/or E2 peptides which can be used for 
diagnosis of HCV infection and for raising antibodies. Such peptides may also be used to isolate human 
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Rule 45, Federal Rules of Civil Procedure, Parts C&D: 

(c) PROTECTION OF PERSONS SUBJECT TO SUBPOENAS. 

(1) A party or an attorney responsible for the issuance and service of a subpoena shall 
take reasonable steps to avoid imposing undue burden or expense on a person subject to 
that subpoena. The court on behalf of which the subpoena was issued shall enforce this 
duty and impose upon the party or attorney in breach of mis duty an appropriate sanction, 
which may include, but is not limited to, lost earnings and reasonable attorney's fee. 

(2XA) A person commanded to produce and permit inspection and copying of 
designated books, papers, documents or tangible things, or inspection of premises need not 
appear in person at the place of production or inspection unless commanded to appear for 
deposition, hearing or trial. 

(b) Subject to paragraph (dX2) of this rule, a person commanded to produce and permit 
inspection and copying may, within 14 days after service of the subpoena or before the 
time specified for compliance if such time is less than 1 4 days after service, serve upon the 
party or attorney designated in the subpoena written objection to inspection or copying of 
any or all of the designated materials or of the premises. If objection is made, the party 
serving the subpoena shall not be entitled to inspect and copy the materials or inspect the 
premises except pursuant' to an order of the court by which the subpoena was issued. If 
objection has been made, the party serving the subpoena may, upon notice to the person 
commended to produce, move at any time for an order to compel the production. Such an 
order to compel production shall protect any person who is not a party or an officer of a 
party from significant expense resulting from the inspection and copying commanded. 

(3XA) On timely motion, the court by which a subpoena was issued shall quash or 
modify the subpoena if it 

(i) fails to allow reasonable time for compliance; 

(ii) requires a person who is not a party or an officer of a party to travel to a place 
more than 100 miles from the place where the person resides, is employed 



or regularly transacts business in person, except that, subject to the provisions of clause 
(cX3XBX u 0 of this rule, such a person may in order to attend trial be commanded to travel 
from any such place within the state in which the trial is held, or 

(Hi) requires disclosure of privileged or other protected matter and no exception or 
waiver applies, or 

(fv) subjects a person to undue burden. 
(B) If a subpoena 

(i) requires disclosure of a trade secret or other confidential research, development, or 
commercial information, or 

(ii) requires disclosure of an unretained expert's opinion or information not describing 
specific events or occurrences in dispute and resulting from the expert's study made not at 
the request of any party, or 

(Hi) requires a person who is not a party or an officer of a party to incur substantial 
expense to travel more than 100 miles to attend trial, the court may, to protect a person 
subject to or affected by the subpoena, quash or modify the subpoena or, if the party in 
whose behalf the subpoena is issued shows a substantial need for the testimony or material 
that cannot be otherwise met without the undue hardship and assures that the person to 
whom the subpoena is addressed will be reasonably compensated, the court may order 
appearance or production only upon specified conditions. 

(d) DUTIES IN RESPONDING TO SUBPOENA. 

(1) A person responding to a subpoena to produce documents shall produce them as 
they are kept in the usual course of business or shall organize and label them to correspond 
with the categories in the demand. 

(2) When information subject to a subpoena is withheld on a claim mat is privileged or 
subject to protection as trial preparation materials the claim shall be made expressly and 
shall be supported by a description of the nature of the documents, communications, or 
things not produced that is sufficient to enable the demanding party to contest the claim 
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monoclonal antibodies. 

It is also an aim of the present inveniion to provide monoclonal antibodies, more particularly human 
monoclonal antibodies or mouse monoclonal antibodies which are humanized, which react specifically with E1 
and/or E2 epitopes, either comprised in peptides or conformational epitopes comprised in recombinant proteins. 
5 It is also an aim of the present invention to provide possible uses of anii-E1 or anti-E2 monoclonal 

antibodies for HCV antigen detection or for therapy of chronic HCV infection. 

It is also an aim of the present invention to provide kits for monitoring/prognosing the response to 
treatment (e.g. with interferon) of patients suffering from HCV infection or monitoring/prognosing the outcome of 
the disease. 

10 All the aims of the present invention are considered to have been met by the embodiments as set out 

below. 

Definitions 

15 The following definitions serve to illustrate the different terms and expressions used in the present 

invention. 

The term 'hepatitis C virus single envelope protein' refers to a polypeptide or an analogue thereof (e.g. 
mimotopes) comprising an amino acid sequence (and/or amino acid analogues) defining at least one HCV 
epitope of either the E1 or the E2 region. These single envelope proteins in the broad sense of the word may be 
20 both monomeric or homo-oligomeric forms of recombinantiy expressed envelope proteins. Typically, the 
sequences defining the epitope correspond to the amino acid sequence of either the El or the E2 region of HCV 
(either identically or via substitution of analogues of the native amino acid residue that do not destroy the 
epitope). In general, the epitope-defining sequence will be 3 or more amino acids in length, more typically, 5 or 
more amino acids in length, more typically 8 or more amino acids in length, and even more typically 10 or more 

2 5 amino acids in length. With respect to conformational epitopes, the length of the epitope-defining sequence can 

be subject to wide variations, since it is believed that these epitopes are formed by the three-dimensional shape 
of the antigen (e.g. folding). Thus, the amino acids defining the epitope can be relatively few in number, but 
widely dispersed along the length of the molecule being brought into the correct epitope conformation via folding. 
The portions of the antigen between the residues defining the epitope may not be critical to the conformational 

3 0 structure of the epitope. For example, deletion or substitution of these intervening sequences rfiay not affect the 

conformational epitope provided sequences critical to epitope conformation are maintained (e.g. cysteines 
involved in disulfide bonding, glycosylate sites, etc.). A conformational epitope may also be formed by 2 or 
more essential regions of subunits of a homooligomer or heterooligomer. 

The HCV antigens of the present invention comprise conformational epitopes from the E1 and/or E2 
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(envelope) domains of HCV. The E1 domain, which is believed to correspond to the viral envelope protein, is 
currently estimated lo span amino acids 192-383 of the HCV polyprotein (Hijikata et a!., 1991). Upon expression 
in a mammalian system (glycosylated), it is believed to have an approximate molecular weight of 35 kDa as 
determined via SOS-PAGE The E2 protein, previously called NS1, is believed to span amino acids 384-309 or 
5 384-746 (Grakoui et a!., 1993) of the HCV polyprotein and to also be an envelope protein. Upon expression in a 
vaccinia system (glycosylated), it is believed to have an apparent gel molecular weight of about 72 kDa. It is 
understood that these protein endpoints are approximations (e.g. the carboxy terminal end of E2 could lie 
somewhere in the 730-820 amino acid region, e.g. ending at amino acid 730, 735, 740, 742, 744, 745, 
preferably 746, 747, 748, 750, 760, 770, 780, 790, 800, 809, 810, 820). The E2 protein may also be expressed 

10 together with the E1 , P7 (aa 747-809),- NS2 (aa 810-1026), NS4A (aa 1658-171 1) or NS4B (aa 1712-1972). 
Expression together with these other HCV proteins may be important for obtaining the correct protein folding. 

It is also understood that the isolates used in the examples section of the present invention were not 
intended to limit the scope of the invention and that any HCV isolate from type 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10 or any 
other new genotype of HCV is a suitable source of El and/or E2 sequence for the practice of the present 

15 invention. 

The E1 and E2 antigens used in the present invention may be full-length viral proteins, substantially 
full-length versions thereof, or functional fragments thereof (e.g. fragments which are not missing sequence 
essential to the formation or retention of an epitope). Furthermore, the HCV antigens of the present invention 
can also include other sequences that do not block or prevent the formation of the conformational epitope of 

2 O interest. The presence or absence of a conformational epitope can be readily determined though screening the 
antigen of interest with an antibody (polyclonal serum or monoclonal to the conformational epitope) and 
comparing its reactivity to that of a denatured version of the antigen which retains only linear epitopes (if any). In 
such screening using polyclonal antibodies, it maybe advantageous to adsorb the polyclonal serum first with the 
denatured antigen and see if it retains antibodies to the antigen of interest. 

2 5 The HCV antigens of the present invention can be made by any recombinant method that provides the 

epitope of intrest. For example, recombinant intracellular expression in mammalian or insect cells is a preferred 
method to provide glycosylated E1 and/or E2 antigens in 'native' conformation as is the case for the natural HCV 
antigens. Yeast cells and mutant yeast strains (e.g. mnn 9 mutant (Kniskem et ai., 1994) or glycosylation 
mutants derived by means of vanadate resistance selection (Ballou et al., 1991)) may be ideally suited for 

30 production of secreted high-mannose-type sugars; whereas proteins secreted from mammalian cells may 
contain modifications including galactose or sialic acids which may be undesirable for certain diagnostic or 
vaccine applications. However, it may also be possible and sufficient for certain applications, as it is known for 
proteins, to express the antigen in other recombinant hosts (such as E. coli) and renature the protein after 
recovery. 
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The term 'fusion polypeptide' intends a polypeptide in which the HCV antigen(s) are part of a single 
continuous chain of amino acids, which chain does not occur in nature. The HCV antigens may be connected 
directly to each other by peptide bonds or be separated by intervening amino acid sequences. The fusion 
polypeptides may also contain amino acid sequences exogenous to HCV. 
5 The term 'solid phase 1 intends a solid body to which the individual HCV antigens or the fusion 

polypeptide comprised of HCV antigens are bound covalently or by noncovalent means such as hydrophobic 
adsorption. 

The term "biological sample' intends a fluid or tissue of a mammalian individual (e.g. an anthropoid, a 
human) that commonly contains antibodies produced by the individual, more particularly antibodies against 

10 HCV. The fluid or tissue may also contain HCV antigen. Such components are known in the art and include, 
without limitation, blood, plasma, serum, urine, spinal fluid, lymph fluid, secretions of the respiratory, intestinal or 
genitourinary tracts, tears, saliva, milk, white blood cells and myelomas. Body components include biological 
liquids. The term 'biological liquid* refers to a fluid obtained from an organism. Some biological fluids are used as 
a source of other products, such as clotting factors (e.g. Factor VIII;C), serum albumin, growth hormone and the 

15 like. In such cases, it is important that the source of biological fluid be free of contamination by virus such as 
HCV. 

Tne term 'immunologically reactive' means that the aniigen in question will react specifically with anti- 
HCV antibodies present in a body component from an HCV infected individual. 

The term 'immune complex' intends the combination formed when an antibody binds to an epitope on 
20 an antigen. 

'EV as used herein refers to a protein or polypeptide expressed within the first 400 amino acids of an 
HCV polyprotein, sometimes referred to as the E, ENV or S protein. In its natural form it is a 35 kDa glycoprotein 
which is found in strong association with membranes. In most natural HCV strains, the E1 protein is encoded in 
the viral polyprotein following the C (core) protein. The E1 protein extends from approximately amino acid (aa), 
25 1 92 to about aa 383 of the full-length polyprotein. 

The term 'E1' as used herein also includes analogs and truncated forms that are immunologically 
cross-reactive with natural El, and includes E1 proteins of genotypes 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or any other 
newly identified HCV type or subtype. 

'E2 1 as used herein refers to a protein or polypeptide expressed within the first 900 amino acids of an 
3 0 HCV polyprotein, sometimes referred to as the NS1 protein. In its natural form it is a 72 kDa glycoprotein that is 
found in strong association with membranes. In most natural HCV strains, the E2 protein is encoded in the viral 
polyprotein following the E1 protein. The E2 protein extends from approximately amino acid position 384 to 
amino acid position 746, another form of E2 extends to amino acid position 809. The term 'E2* as used herein 
also includes analogs and truncated forms that are immunologically cross-reactive with natural E2. For example, 
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insertions of multiple codons between codon 383 and 384, as well as deletions of amino acids 38^387 have 
been reported by Kato et al. (1 992). 

'E1/E2* as used herein refers to an oligomeric form of envelope proteins containing at least one E1 
component and at least one E2 component 
5 The term 'specific oligomeric 1 E1 and/or £2 and/or E1/E2 envelope proteins refers to all possible 

oligomeric forms of recombinant^ expressed El and/or E2 envelope proteins which are not aggregates. El 
and/or E2 specific oligomeric envelope proteins ace also referred to as homo-oiigomeric E1 or E2 envelope 
proteins (see below). 

The term 'single or specific oligomeric' E1 and/or E2 and/or E1/E2 envelope proteins refers to single 

1 0 monomeric E1 or E2 proteins (single in the strict sense of the word) as well as specific oligomeric E1 and/or E2 
and/or E1/E2 recombinant^ expressed proteins. These single or specific oligomeric envelope proteins according 
to the present invention can be further defined by the following formula (E1)*(E2)y wherein x can be a number 
between 0 and 100, and y can be a number between o and 100, provided that x and y are not both 0. With x=l 
and y=0 said envelope proteins include monomeric El . 

15 The term 'homo-oligomer' as used herein refers to a complex of El and/or E2 containing more than 

one E1 or E2 monomer, e.g. E1/E1 dimers. E1/E1/E1 trimers or E1/E1/E1/E1 tetramers and E2'E2 dimers, 
E2/E2/E2 trimers or E2/E2/E2/E2 tetramers. El pentamers ana hexamers, E2 pentamers and hexamers or any 
higher-order homo-oligomers of E1 or E2 are all 'homo-oligomers' within the scope of this definition. The 
oligomers may contain one, two, or several different monomers of E1 or E2 obtained from different types or 

20 subtypes of hepatitis C virus including for example those described in an international application published 
under WO 94/25601 and European application No. 94870166.9 both by the present applicants. Such mixed 
oligomers are still homo-oligomers within the scope of this invention, and may allow more universal diagnosis, 
prophylaxis or treatment of HCV. 

The term 'purified' as applied to proteins herein refers to a composition wherein the desired protein 

25 comprises at least 35% of the total protein component in the composition. The desired protein preferably 
comprises at least 40%, more preferably at least about 50%, more preferably at least about 60%, still more 
preferably at least about 70%, even more preferably at least about 80%, even more preferably at least about 
90%, and most preferably at least about 95% of the total protein component. The composition may contain other 
compounds such as carbohydrates, salts, lipids, solvents, and the like, withouth affecting the determination of 

3.0 the percentage purity as used herein. An 'isolated 1 HCV protein intends an HCV protein composition that is at 
least 35% pure. 

The term 'essentially purified proteins' refers to proteins purified such that they can be used for in vitro 
diagnostic methods and as a therapeutic compound. These proteins are substantially free from cellular proteins, 
vector-derived proteins or other HCV viral components. Usually these proteins are purified to homogeneity (at 
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least 80% pure, preferably, 90%, more preferably 95%, more preferably 97%, more preferably 98%, more 
preferably 99%, even more preferably 99.5%, and most preferably the contaminating proteins should be 
undetectable by conventional methods like SDS-PAGE and silver staining. 

The term 'recombinantiy expressed' used within the context of the present invention refers to the fact 
5 that the proteins of the present invention are produced by recombinant expression methods be it in prokaryotes, 
or lower or higher eukaryotes as discussed in detail below. 

The term lower eukaryote' refers to host cells such as yeast, fungi and the like. Lower eukaryotes are 
generally (but not necessarily) unicellular. Preferred lower eukaryotes are yeasts, particularly species within 
Saccharomvces. Schizosaccharomvces, Kluveromyces . Pichia (e.g. Pichia pastoris) , Hansenula (e.g. 
10 Hansenula polymorpha) . Yarowia. Schwaniomvces , Schizosaccharomvces . Zyqosaccharomvces and the like. 
Saccharomvces cerevisiae , S. carlsberoensis and K. lactis are the most commonly used yeast hosts, and are 
convenient fungal hosts. 

The term 'prokaryoies' refers to hosts such as Ecoli , Lactobacillus . Lactococcus . Salmonella . 
Streptococcus . Bacillus subtilis or Streotomvces . Also these hosts are contemplated within the present invention. 
15 The term "higher eukaryote' refers to host cells derived from higher animals, such as mammals, 

reptiles, insects, and the like. Presently preferred higher eukaryote host cells are derived from Chinese hamster 
(e.g. CHO), monkey (e.g. COS and Vera cells), baby hamster kidney (BHK), pig kidney (PK15), rabbit kidney 13 
cells (RK13), the human osteosarcoma cell line 143 B, the human cell line HeLa and human hepatoma cell lines 
like Hep G2, and insect cell lines (e.g. Spodoptera fruoiperda) . The host cells may be provided in suspension or 

2 0 flask cultures, tissue cultures, organ cultures and the like. Alternatively the host cells may also be transgenic 

animals. 

The term 'polypeptide 1 refers to a polymer of amino adds and does not refer to a specific length of the 
product; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. This term 
also does not refer to or exclude post-expression modifications of the polypeptide, for example, glycosylates, 
25 acetylations, phosphorylations and the like. Included within the definition are, for example, polypeptides 
containing one or more analogues of an amino acid (including, for example, unnatural amino acids. PNA, etc.), 
polypeptides with substituted linkages, as well as other modifications known in the art, both naturally occurring 
and non-naturally occurring. 

The term 'recombinant polynucleotide or nucleic acid 1 intends a polynucleotide or nucleic acid of 

3 0 genomic, cDNA, semisynthetic, or synthetic- origin which, by virtue of its origin or manipulation : (1) is not 

associated with all or a portion of a polynucleotide with which it is associated in nature, (2) is linked to a 
polynucleotide other than that to which it is linked in nature, or (3) does not occur in nature. 

The term 'recombinant host cells', 'host cells', 'cells', 'cell lines', 'cell cultures', and other such terms 
denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities refer to cells which can be 
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or have been, used as recipients for a recombinant vector or other transfer polynucleotide, and include the 
progeny of the original cell which has been transfected. It is understood that the progeny of a singie parental cell 
may not necessarily be completely identical in morphology or in genomic or total DNA complement as the 
original parent, due to natural, accidental, or deliberate mutation. 
5 The term Yeplicon' is any genetic element, e.g., a plasmid, a chromosome, a virus, a cosmid, etc., that 

behaves as an autonomous unit of polynucleotide replication within a ceil; i.e., capable of replication under its 
own control. 

The term Vector' is a replicon further comprising sequences providing replication and/or expression of a 
desired open reading frame. 

10 tenn 'control sequence 1 refers to polynucleotide sequences which are necessary to effect the 

expression of coding sequences to which they are ligated. The nature of such control sequences differs 
depending upon the host organism; in prokaryotes, such control sequences generally include promoter, 
ribosomal binding site, and terminators; in eukaryotes, generally, such control sequences include promoters, 
terminators and, in some instances, enhancers. The term 'control sequences' is intended to include, at a 

15 minimum, ail components whose presence is necessary for expression, and may also include additional 
components whose presence is advantageous, for example, leader sequences which govern secretion. 

The term 'promoter' is a nucleotide sequence which is comprised of consensus sequences which allow 
the binding of RNA polymerase to the DMA template in a manner such that mRNA production initiates at the 
normal transcription initiation site for the adjacent structural gene. 

20 The expression 'operably linked' refers to a juxtaposition wherein the components so described are in a 

relationship permitting them to function in their intended manner. A control sequence 'operably linked' to a 
coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 

An 'open reading frame' (ORF) is a region of a polynucleotide sequence which encodes a polypeptide 
25 and does not contain stop codons; this region may represent a portion of a coding sequence or a total coding 
sequence. 

A 'coding sequence' is a polynucleotide sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start codon at the S'-terminus and a translation stop codon at 
30 the 3*-terminus. A coding sequence can include but is not limited to mRNA, DNA (including cDNA), and 
recombinant polynucleotide sequences. 

As used herein, 'epitope' or 'antigenic determinant 1 means an amino add sequence that is 
immunoreactive. Generally an epitope consists of at least 3 to 4 amino acids, and more usually, consists of at 
least 5 or 6 amino acids, sometimes the epitope consists of about 7 to 8, or even about 10 amino acids. As used 
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herein, an epitope of a designated polypeptide denotes epitopes with the same amino acid sequence as the 
epitope in the designated polypeptide, and immunologic equivalents thereof. Such equivalents also include 
strain, subtype (=genotype), or type(group)-specific variants, e.g. of the currently known sequences or strains 
belonging to genotypes 1a, 1b, 1c, 1d, 1e, 1f, 2a, 2b, 2c, 2d, 2e, 2f, 2g, 2h, 2i, 3a, 3b, 3c, 3d, 3e, 3f, 3g, 4a, 4b, 
5 4c, 4d, 4e, 4f, 4g, 4h, 4i, 4j, 4k, 41, 5a. 5b, 6a, 6b, 6c, 7a, 7b, 7c, 8a, 8b, 9a, 9b, 10a, or any other newly defined 
HCV (sub)type. It is to be understood that the amino adds constituting the epitope need not be part of a linear 
sequence, but may be interspersed by any number of amino acids, thus forming a conformational epitope. 

The term Immunogenic' refers to the ability of a substance to cause a humoral and/or cellular 
response, whether alone or when linked to a carrier, in the presence or absence of an adjuvant. 'Neutralization 1 
1 0 refers to an immune response that blocks the infecBvity, either partially or fully, of an infectious agent. A 'vaccine' 
is an immunogenic composition capable of eliciting protection against HCV, whether partial or complete. A 
vaccine may also be useful for treatment cf an individual, in which case it is called a therapeutic vaccine. 

The term 'therapeutic' refers to a composition capable of treating HCV infection. 

The term 'effective amount' refers to an amount of epitope-bearing polypeptide sufficient to induce an 
1 5 immunogenic response in the individual to which it is administered, or to otherwise detectably immunoreact in its 
intended system (e.g., immunoassay). Preferably, the effective amount is sufficient to effect treatment, as 
defined above. The exact amount necessar/ will vary according to the application. For vaccine applications or 
for the generation of polyclonal antiserum / antibodies, for example, the effective amount may vary depending on 
the species, age, and general condition of the individual, the severity of the condition being treated, the particular 
2 0 polypeptide selected and its mode of administration, etc. It is also believed that effective amounts will be found 
within a relatively large, non-critical range. An appropriate effective amount can be readily determined using only 
routine experimentation. Preferred ranges of El and/or E2 and/or E1/E2 single or specific oligomeric envelope 
proteins for prophylaxis of HCV disease are 0.01 to 100 jig/dose, preferably 0.1 to 50 ^g/dose. Several doses 
may be needed per individual in order to achieve a sufficient immune response and subsequent protection 

2 5 against HCV disease. 

Detailed description of the invention 

3 0 More particularly, the present invention contemplates a method for isolating or purifying recombinant 

HCV single or specific oligomeric envelope protein selected from the group consisting of E1 and/or E2 and/or 
E1/E2, characterized in that upon lysing the transformed host cells to isolate the recombinantly expressed 
protein a disulphide bond cleavage or reduction step is carried out with a disculphide bond cleaving agent. 

The essence of these 'single or specific oligomeric 1 envelope proteins of the invention is that they are 
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free from contaminating proteins and that they are not disulphide bond linked with contaminants. 

The proteins according to the present invention are recombinantly expressed in lower or higher 
eukafyotic cells or in prokaryotes. The recombinant proteins of the present invention are preferably glycosylated 
and may contain high-mannose-type, hybrid, or complex glycosylates. Preferentially said proteins are 
5 expressed from mammalian cell lines as discussed in detail in the Examples section, or in yeast such as in 
mutant yeast strains also as detailed in the Examples section. 

The proteins according to the present invention may be secreted or expressed within components of 
the ceil, such as the ER or the Golgi Apparatus. Preferably, however, the proteins of the present invention bear 
high-mannose-type glycosylates and are retained in the ER or Golgi Apparatus of mammalian cells or are 
1 0 retained in or secreted from yeast cells, preferably secreted from yeast mutant strains such as the mnn9 mutant 
(Kniskem et al M 1994), or from mutants that have been selected by means of vanadate resistence (Ballou et al., 
1991). 

Upon expression of HCV envelope proteins, the present inventors could show that some of the free 
thiol groups of cysteines not involved in intra- or inter-molecular disulphide bridges, react with cysteines of host 

15 or expression-system-derived (e.g. vaccinia) proteins or of other HCV envelope proteins (single or oligomeric), 
and form aspecific intermolecular bridges. This results in the formation of 'aggregates' of HCV envelope proteins 
together with contaminating proteins. It was also shown in WO 92/08734 that 'aggregates' were obtained after 
purification, but it was not described which protein interactions were involved. In patent application WO 
92/08734, recombinant E1/E2 protein expressed with the vaccinia virus system were partially purified as 

20 aggregates and only found to be 70% pure, rendering the purified aggregates not useful for diagnostic, 
prophylactic or therapeutic purposes. 

Therefore, a major aim of the present invention resides in the separation of single or specific-oiigomeric 
HCV envelope proteins from contaminating proteins, and to use the purified proteins (> 95% pure) for diagnostic, 
prophylactic and therapeutic purposes. To those purposes, the present inventors have been able to provide 

25 ' evidence that aggregated protein complexes ('aggregates') are formed on the basis of disulphide bridges and 
non-covalent protein-protein interactions. The present invention thus provides a means for selectively cleaving 
the disulphide bonds under specific conditions and for separating the cleaved proteins from contaminating 
proteins which greatly interfere with diagnostic, prophylactic and therapeutic applications. The free thiol groups 
may be blocked (reversibly or irreversibly) in order to prevent the reformation of disulphide bridges, or may be 

30 left to oxidize and oligomerize with other envelope proteins (see definition homo-oligomer). It is to be understood 
that such protein oligomers are essentially different from the 'aggregates' described in WO 92/08734 and WO 
94/01778, since the level of contaminating proteins is undetectable. 

Said disuphide bond cleavage may also be achieved by: 
(1) performic acid oxidation by means of cysteic acid in which case the cysteine residues are modified into 
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cysteic add (Moore et al. f 1 963). 

(2) Sulfitolysis (R-S-S-R - 2 R-SCh) for example by means of sulphite (SO 2 a) together with a proper oxidant 
such as Cu 2 * in which case the cysteine is modified into S-suIpho-cysteine (Bailey and Cole, 1959). 

(3) Reduction by means of mercaptans, such as dithiotrertoi (DDT), 8-mercapto-ethanol, cysteine, glutathione 
5 Red, s-mercapto-ethylamine, or thioglycollic acid, of which DTT and B-mercapto-ethanol are commonly used 

(Cleland, 1964), is the preferred method of this invention because the method can be performed in a water 
environment and because the cysteine remains unmodified. 

(4) Reduction by means of a phosphine (e.g. Su-P) (Ruegg and Rudinger, 1977). 

All these compounds are thus to be regarded as agents or means for cleaving disulphide bonds 
1 0 according to the present invention. 

Said disulphide bond cleavage (or reducing) step of the present invention is preferably a partial 
disulphide bond cleavage (reducing) step (carried out under partial cleavage or reducing conditions). 

A preferred disulphide bond cleavage or reducing agent according to the present invention is 
dithiothreitol (DTT). Partial reduction is obtained by using a low concentration of said reducing agent, i.e. for OTT 
15 for example in the concentration range of about 0.1 to about 50 mM, preferably about 0.1 to about 20 mM, 
preferably about 0.5 to about 10 mM, preferably more than 1 mM, more than 2 mM or more than 5 mM, more 
preferably about 1.5 mM, about 2.0 mM, about 2.5 mM. about 5 mM or about 7.5 mM. 

Said disulphide bond cleavage step may also be carried out in the presence of a suitable detergent (as 
an example of a means for cleaving disulphide bonds or in combination with a cleaving agent) able to dissociate 

2 0 the expressed proteins, such as DecylPEG, EMPIGEN-BB, NP-40, sodium cholate, Triton X-100. 

Said reduction or cleavage step (preferably a partial reduction or cleavage step) is carried out 
preferably in in the presence of (with) a detergent. A preferred detergent according to the present invention is 
Empigen-BB. The amount of detergent used is preferably in the range of 1 to 10 %, preferably more than 3%, 
more preferably about 3.5% of a detergent such as Empigen-BB. 
25 A particularly preferred method for obtaining disulphide bond cleavage employs a combination of a 

classical disulphide bond cleavage agent as detailed above and a detergent (also as detailed above). As 
contemplated in the Examples section, the particular combination of a low concentration of DTT (1.5 to 7.5 mM) 
and about 3.5 % of Empigen-BB is proven (o be a particularly preferred combination of reducing agent and 
detergent for the purification of recombinant^ expressed E1 and E2 proteins. Upon gelfOtration chromatography, 

3 0 said partial reduction is shown to result in the production of possibly dimeric E1 protein and separation of this E1 

protein from contaminating proteins that cause false reactivity upon use in immunoassays. 

It is, however, to be understood that also any other combination of any reducing agent known in the art 
with any detergent or other means known in the art to make the cysteines better accessible is also within the 
scope of the present invention, insofar as said combination reaches the same goal of disulphide bridge cleavage 
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as the preferred combination examplified in the present invention. 

Apart from reducing the disulphide bonds, a disulphide bond cleaving means according to the present 
invention may also include any disulphide bridge exchanging agents (competitive agent being either organic or 
proteinaeous, see for instance Creighton, 1988} known in the art which allows the following type of reaction to 
5 occur 

R1S-SR2 + R3SH -R1 S-SR3 + R2SH 
* R1, R2: compounds of protein aggregates 
' R3 SH: competitive agent (organic, proteinaeous) 
The term 'disulphide bridge exchanging agent 1 is to be interpretated as including disulphide bond 
1 0 reforming as well as disulphide' bond blocking agents. 

The present invention also relates to methods for purifying or isolating HCV single or specific oligomeric 
envelelope proteins as set out above further including the use of any SH group blocking or binding reagent 
known in the art such as chosen from the following list: 
Glutathion 

15 - 5,5'-dithiobis-(2-nitrobenzoic acid) or bis-(3-carboxy-4-nitrophenyl)-disulphide (DTNB or Biman's 
reagent) (Elmann, 1959) 
N-ethylmaleimide (NEM; Benesch et al., 1 956) 

N-(4-djmethylamino-3,5-dinitrophenyl) maleimide or Tupp/s maleimide which provides a color to the 
protein 

2 0 - P-chloromercuribenzoate (Grassetti et al., 1 969) 

4-vinyipyridine (Friedman and KruM, 1 969) can be liberated after reaction by acid hydrolysis 
acrylonitrile, can be liberated after reaction by acid hydrolysis (Weil and Seibles, 1 961 ) 
NEM-biotin (e.g. obtained from Sigma 81 267) 
2,2'-dithiopyridine (Grassetti and Murray, 1 967) 

2 5 - 4,4'-dithiopyridine (Grassetti and Murray, 1 967) 

6 f 6 '-dithiodin icon tinic acid (DTDNA; Brown and Cunnigham, 1 970) 

2,2'-dithiobis-(5 , -nitropyridine) (DTNP; US patent 3597160) or other dithiobis (heterocyclic derivative) 
compounds (Grassetti and Murray, 1969) 

A survey of the publications cited shows that often different reagents for sulphydry! groups will react 

3 0 with varying numbers of thiol groups of the same protein or enzyme molecule. One may conclude that this 

variation in reactivity of the thiol groups is due to the steric environment of these groups, such as the shape of 
the molecule and the surrounding groups of atoms and their charges, as well as to the size, shape and charge of 
the reagent molecule or ion. Frequently the presence of adequate concentrations of denaturants such as sodium 
dodecylsulfate, urea or guanidine hydrochloride will cause sufficient unfolding of the protein molecule to permit 
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equal access to ad of the reagents for thiol groups. By varying the concentration of denaturant, the degree of 
unfolding can be controlled and in this way thiol groups with different degrees of reactivity may be revealed. 
Although up to date most of the work reported has been done with p-chloromercuribenzoate, N-ethylmaleimide 
and DTNB, it is likely that the other more recently developed reagents may prove equally useful Because of 
5 their varying structures, it seems likely, in fact, that they may respond differently to changes in the steric 
environment of the thiol groups. 

Alternatively, conditions such as low pH (preferably lower than pH 6) for preventing free SH groups 
from oxidizing and thus preventing the formation of large intermolecular aggregates upon recombinant 
expression and purification of El and E2 (envelope) proteins are also within the scope of the present invention. 
10 A preferred SH group blocking reagent according to the present invention is N-ethylmaleimide (NEM). 

Said SH group blocking reagent may be administrated during lysis of the recombinant host cells and after the 
above-mentioned partial reduction process or after any other process for cleaving disulphide bridges. Said SH 
group blocking reagent may also be modified with any group capable of providing a detectable label and/or any 
group aiding in the immobilization of said recombinant protein to a solid substrate, e.g. biotinylated NEM. 
15 Methods for cleaving cysteine bridges and blocking free cysteines have also been described in Darbre 

(1987), Means and Feeney (1971), and by Wong (1993). 

A method to purify single or specific oligomeric recombinant El and/or E2 and/or E1/E2 proteins 
according to the present invention as defined above is funher characterized as comprising the following steps: 

lysing recombinant E1 and/or E2 and/or E1/E2 expressing host cells, preferably in the presence of an 
20 SH group blocking agent, such as N-ethylmaieimide (NEM), and possibly a suitable detergent, 

preferably Empigen-BB, 

recovering said HCV envelope protein by affinity purification for instance by means lectin- 
chromatography, such as lentil-lectin chromatography, or immunoaffinity chromatography using anti-El 
and/or anti-E2 specific monoclonal antibodies, followed by, 

2 5 - reduction or cleavage of disulphide bonds with a disulphide bond cleaving agent, such as DTT, 

preferably also in the presence of an SH group blocking agent, such as NEM or Biotin-NEM, and, 
recovering the reduced HCV E1 and/or E2 and/or E1/E2 envelope proteins for instance by gelfiltration 
(size exclusion chromatography or molecular sieving) and possibly also by an additional NP+-IMAC 
chromatography and desalting step. 

3 0 It is to be understood that the above-mentioned recovery steps may also be carried out using any other 

suitable technique known by the person skilled in the art. 

Preferred lectin-chromatography systems include Galanthus nivalis agglutinin (GNA) - 
chromatography, or Lens culinaris agglutinin (LCA) (lentil) lectin chromatography as illustrated in the Examples 
section. Other useful lectins include those recognizing high-mannose type sugars, such as Narcissus 
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pseudonarcissus agglutinin (NPA), Pisum sativum agglutinin (PSA), or Allium ursinum agglutinin (AUA). 

Preferably said method is usable to purify single or specific oligomeric HCV envelope protein produced 
intracellular^ as detailed above. 

For secreted El or E2 or E1/E2 oligomers, lectins binding complex sugars such as Ricinus communis 
5 agglutinin I (RCA I), are preferred lectins. 

The present invention more particularly contemplates essentially purified recombinant HCV single or 
specific oligomeric envelope proteins, selected from the group consisting of E1 and/or E2 and/or E1/E2, 
characterized as being isolated or purified by a method as denned above. 

The present invention more particularly relates to the purification or isolation of recombinant envelope 
1 0 proteins which are expressed from recombinant mammalian cells such as vaccinia. 

The present invention also relates to the purification or isolation of recombinant envelope proteins 
which are expressed from recombinant yeast calls. 

The present invention equally relates to the purification or isolation of recombinant envelope proteins 
which are expressed from recombinant bacterial (prokaryotic) cells. 
15 The present invention also contemplates a recombinant vector comprising a vector sequence, an 

appropriate prokaryotic. eukar/otic or viral or synthetic promoter sequence followed by a nucleotide sequence 
allowing the expression of the single or specific oligomeric El and/or E2 and/or E1/E2 of the invention. 

Particularly, the present invention contemplates a recombinant vector comprising a vector sequence, 
an appropriate prokaryotic, eukaryotic or viral or synthetic promoter sequence followed by a nucleotide 
2 0 sequence allowing the expression of the single E1 or El of the invention. 

Particularly, the present invention contemplates a recombinant vector comprising a vector sequence, 
an appropriate prokaryotic, eukar/otic or viral or synthetic promoter sequence followed by a nucleotide 
sequence allowing the expression of the single E1 or E2 of the invention. 

The segment of the HCV cDNA encoding the desired E1 and/or E2 sequence inserted into the vector 

2 5 sequence may be attached to a signal sequence. Said signal sequence may be that from a non-HCV source, 

e.g. the IgG or tissue plasminogen activator (tpa) leader sequence for expression in mammalian cells, or the a- 
mating factor sequence for expression into yeast cells, but particularly preferred constructs according to the 
present invention contain signal sequences appearing in the HCV genome before the respective start points of 
the E1 and E2 proteins. The segment of the HCV cDNA encoding the desired E1 and/or E2 sequence inserted 

3 0 into the vector may also include deletions e.g. of the hydrophobic domain(s) as illustrated in the examples 

section, or of the E2 hypervariable region I. 

More particularly, the recombinant vectors according to the present invention encompass a nucleic acid 
having an HCV cDNA segment encoding the polyprotein starting in the region between amino acid positions 1 
and 192 and ending in the region between positions 250 and 400 of the HCV polyprotein, more preferably 
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ending in the region between positions 250 and 341, even more preferably ending in the region between 
positions 290 and 341 for expression of the HCV single E1 protein. Most preferably, the present recombinant 
vector encompasses a recombinant nucleic acid having a HCV cONA seqment encoding part of the HCV 
polyprotein starting in the region between positions 117 and 192, and ending at any position in the region 
5 between positions 263 and 326, for expression of HCV single E1 protein. Also within the scope of the present 
invention are forms that have the first hydrophobic domain deleted (positions 264 to 293 plus or minus 8 amino 
acids), or forms to which a S'-terminal ATG codon and a 3'-terminal stop codon has been added, or forms which 
have a factor Xa cleavage site and/or 3 to 10, preferably 6 Histidine codons have been added. 

More particularly, the recombinant vectors according to the present invention encompass a nucleic acid 

1 0 having an HCV cDNA segment encoding the polyprotein starting in the region between amino acid positions 290 
and 406 and ending in the region between positions 600 and 820 of the HCV polyprotein, more preferably 
starting in the region between positions 322 and 406, even more preferably starting in the region between 
positions 347 and 406, even still more preferably starting in the region between positions 364 and 406 for 
expression of the HCV single E2 protein. Most preferably, the present recombinant vector encompasses a 

15 recombinant nucleic acid having a HCV cDNA seqment encoding the polyprotein starting in the region between 
positions 290 and 406, and ending at any position of positions 623, 650. 661, 573. 710, 715, 720, 746 or 809, for 
expression of HCV single E2 protein. Also within the scope of the present invention are forms to which a S'- 
terminal ATG codon and a 3'-terminal stop codon has been added, or forms which have a factor Xa cleavage 
site and/or 3 to 1 0, preferably 6 Histidine codons have been added. 

20 A variety of vectors may be used to obtain recombinant expression of HCV single or specific oligomeric 

envelope proteins of the present invention. Lower eukaryotes such as yeasts and glycosylate mutant strains 
are typically transformed with plasmids, or are transformed with a recombinant virus. The vectors may replicate 
within the host independently, or may integrate into the host cell genome. 

Higher eukaryotes may be transformed with vectors, or may be infected with a recombinant virus, for 

2 5 example a recombinant vaccinia virus. Techniques and vectors for the insertion of foreign DNA into vaccinia 

virus are well known in the art, and utilize, for example homologous recombination. A wide variety of viral 
promoter sequences, possibly terminator sequences and poly(A)-addition sequences, possibly enhancer 
sequences and possibly amplification sequences, all required for the mammalian expression, are available in the 
art. Vaccinia is particularly preferred since vaccinia halts the expression of host cell proteins. Vaccinia is also 

3 0 very much preferred since it allows the expression of E1 and E2 proteins of HCV in cells or individuals which are 

immunized with the live recombinant vaccinia virus. For vaccination of humans the avipox and Ankara Modified 
Virus (AMV) are particularly useful vectors. 

Also known are insect expression transfer vectors derived from baculovirus Autoarapha califomica 
nuclear polyhedrosis virus (AcNPV), which is a helper-independent viral expression vector. Expression vectors 
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derived from this system usually use the strong viral polyhedrin gene promoter to drive the expression of 
heterologous genes. Different vectors as well as methods for the introduction of heterologous DNA into the 
desired site of baculovirus are available to the man skilled in the art for baculovirtis expression. Also different 
signals for posttranslationaJ modification recognized by insect cells are known in the art. 
5 Also included within the scope of the present invention is a method for producing purified recombinant 

single or specific oligomeric HCV E1 or £2 or E1.E2 proteins, wherein the cysteine residues involved in 
aggregates formation are replaced at the level of the nucleic acid sequence by other residues such that 
aggregate formation is prevented. The recombinant proteins expressed by recombinant vectors caarying such a 
mutated E1 and/or E2 protein encoding nucleic acid are also within the scope of the present invention. 

10 The present invention also relates to recombinant E1 and/or E2 and/or E1/E2 proteins characterized in 

that at least one of their glycosylate sites has been removed and are consequently termed glycosylate 
mutants. As explained in the Examples section, different glycosylate mutants may be desired to diagnose 
(screening, confirmation, prognosis, etc.) and prevent HCV disease according to the patient in question. An E2 
protein glycosylate mutant lacking the GLY4 has : cr instance been found to improve the reactivity of certain 

15 sera in diagnosis. These glycosylate mutants are preferably purified according to the method disclosed in the 
present invention. Also contemplated within the present invention are recombinant vectors carrying the nucleic 
acid insert encoding such a E1 and/or E2 andfcr E;,E2 glycosylate mutant as well as host cells tranformed 
with such a recombinant vector. 

The present invention also relates to recomcinant vectors including a polynucleotide which also forms 

2 0 part of the present invention. The present invention relates more particularly to the recombinant nucleic acids as 

represented in SEQ ID NO 3, 5, 7, 9, 11, 13, 21. 23, 25. 27, 29, 31, 35, 37, 39, 41, 43, 45, 47 and 49, or parts 
thereof. 

The present invention also contemplates host cells transformed with a recombinant vector as defined 
above, wherein said vector comprises a nucleotide sequence encoding HCV E1 and/or E2 and/or E1/E2 protein 
25 as defined above in addition to a regulatory sequence operably linked to said HCV E1 and/or E2 and/or E1/E2 
sequence and capable of regulating the expression cf said HCV E1 and/or E2 and/or E1/E2 protein. 

Eukaryotic hosts include lower and higher eukaryotic hosts as described in the definitions section. 
Lower eukaryotic hosts include yeast cells well known in Ihe art. Higher eukaryotic hosts mainly include 
mammalian cell lines known in the art and include many immortalized cell lines available from the ATCC, 

3 0 , iniuding HeLa cells, Chinese hamster ovary (CHO) cells. Baby hamster kidney (BHK) cells, PK15, RK13 and a 

number of other ceil lines. 

The present invention relates particularly to a recombinant E1 and/or E2 and/or E1/E2 protein 
expressed by a host cell as defined above containing a recombinany vector as defined above. These 
recombinant proteins are particularly purified according to the method of the present invention. 
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A preferred method for isolating or purifying HCV envelope proteins as defined above is further 
characterized as comprising at least the following steps: 

growing a host eel! as defined above transformed with a recombinant vector according to the present 
Invention or with a known recombinant vector expressing E1 and/or E2 and/or E1/E2 HCV envelope 
5 proteins in a suitable culture medium, 

causing expression of said vector sequence as defined above under suitable conditions, and. 

lysing said transformed host cells, preferably in the presence of a SH group blocking agent, such as N- 

ethylmaleimide (NEM), and possibly a suitable detergent, preferably Empigen-BB, 

recovering said HCV envelope protein by affinity purification such as by means of lectin- 

10 "chromatography or immunoaffinity chromatography using anti-E1 and/or anti-E2 specific monoclonal 

antibodies, with said lectin being preferably lentii-Iectin or GNA, followed by, 
incubation of the eluate of the previous step with a disulphide bond cleavage means, such as DTT, 
preferably followed by incubation with an SH group blocking agent, such as NEM or Biotin-NEM, and, 
isolating the HCV single or specific oligomeric E1 and/or E2 and/or E1/E2 proteins such as by means 

15 of gelfiltration and possibly also by a subsequent NP*-IMAC chromatography followed by a desalting 

step. 

As a result of the above-mentioned proces, E1 and/or E2 and/or E1/E2 proteins may be produced in a 
form which eiute differently from the large aggregates containing vector-derived components and/or cell 
components in the void volume of the gelfiltration column or the IMAC collumn as illustrated in the Examples 
20 section. The disulphide bridge cleavage step advantageously also eliminates the false reactivity due to the 
presence of host and/or expression-system-derived proteins. The presence of NEM and a suitable detergent 
during lysis of the cells may already partly or even completely prevent the aggregation between the HCV 
envelope proteins and contaminants. 

NF+-IMAC chromatography followed by a desalting step is preferably used for contructs bearing a (His)e 
25 as described by Janknecht et ai., 1991 , and Hochuli et al., 1988. 

The present invention also relates to a method for producing monoclonal antibodies in small animals 
such as mice or rats, as well as a method for screening and isolating human B-cells that recognize anti-HCV 
antibodies, using the HCV single or specific oligomeric envelope proteins of the present invention. 

The present invention further relates to a composition comprising at least one of the following E1 
3 0 peptides as listed in Table 3: 

E1-31 (SEQ ID NO 56) spanning amino acids 181 to 200 of the Core/E1 V1 region, 
E1 -33 (SEQ ID NO 57) spanning amino acids 1 93 to 21 2 of the E1 region, 
E1-35 (SEQ ID NO 58) spanning amino acids 205 to 224 of the E1 V2 region (epitope B), 
E1-35A (SEQ ID NO 59) spanning amino acids 208 to 227 of the E1 V2 region (epitope B). 
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1bE1 (SEQ ID NO 53) spanning amino acids 192 to 228 of El regions (V1, C1, and V2 regions 
(containing epitope B)), 

E1-51 (SEQ ID NO 66) spanning amino acids 301 to 320 of the E1 region, 
E1-53 (SEQ ID NO 67) spanning amino acids 313 to 332 of the E1 C4 region (epitope A), 
5 E1 -55 (SEQ ID NO 68) spanning amino acids 325 to 344 of the E1 region. 

The present invention also relates to a composition comprising at least one of the following E2 peptides 
as listed in Table 3: 

Env 67 or E2-67 (SEQ ID NO 72) spanning amino acid positions 397 to 416 of the E2 region (epitope 
A, recognized by monoclonal antibody 2F10H10. see Figure 19), 
1 0 Env 69 or E2-69 (SEQ ID NO 73) spanning amino acid positions 409 to 426 of the E2 region (epitope 

A), 

Env 23 or E2-23 (SEQ ID NO 85) spanning positions 583 to 602 of the E2 region (epitope E), 
Env 25 or E2-25 (SEQ ID NO 87) spanning positions 595 to 614 of the E2 region (epitope E), 
Env 27 or E2-27 (SEQ ID NO 88) spanning positions 507 to 626 of the E2 region (epitope E), 
15 Env 17B or E2-1 78 (SEQ ID NO 83) spanning positions 547 to 566 of the E2 region (epitope D), 

Env 138 or E2-13B (SEQ ID NO 82) spanning positions 523 to 542 of the E2 region (epitope C; 
recognized by monoclonal antibody 16A6E7, sea Figure 19). 

The present invention also relates to a composition comprising at least one of the following E2 
conformational epitopes: 

20 epitope F recognized by monoclonal antibodies 15C8C1, 12D11F1 and8G10D1H9, 

epitope G recognized by monoclonal antibody 9G3E5, 
epitope H (or C) recognized by monoclonal antibody 10D3C4 and 4H632, or, 
epitope I recognized by monoclonal antibody 17F2C2. 

The present invention also relates to an E1 or £2 specific antibody raised upon immunization with a 
25 peptide or protein composition, with said antibody being specifically reactive with any of the polypeptides or 
peptides as defined above, and with said antibody being preferably a monoclonal antibody. 

The present invention also relates to an E1 or E2 specific antibody screened from a variable chain 
library in plasmids or phages or from a population of human B-cells by means of a process known in the art, with 
said antibody being reactive with any of the polypeptides or peptides as defined above, and with said antibody 
3 0 being preferably a monoclonal antibody. 

The E1 or E2 specific monoclonal antibodies of the invention can be produced by any hybridoma liable 
to be formed according to classical methods from splenic cells of an animal, particularly from a mouse or rat. 
immunized against the HCV polypeptides or peptides according to the invention, as defined above on the one 
hand, and of cells of a myeloma cell line on the other hand, and to be selected by the ability of the hybridoma to 
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produce the monoclonal antibodies recognizing the polypeptides which has been initially used for the 
immunization of the animals. 

The antibodies involved in the invention can be labelled by an appropriate label of the enzymatic, 
fluorescent, or radioactive type. 
5 The monoclonal antibodies according to this preferred embodiment of the invention may be humanized 

versions of mouse monoclonal antibodies made by means of recombinant DNA technology, departing from pans 
of mouse and/or human genomic DNA sequences coding for H and L chains from cONA or genomic clones 
coding for H and L chains. 

Alternatively the monoclonal antibodies according to this preferred embodiment of the invention may be 
1 0 human monoclonal antibodies. These antibodies according to the present embodiment of the invention can also 
be derived from human peripheral blood lymphocytes of patients infected with HCV, or vaccinated against HCV. 
Such human monoclonal antibodies are prepared, for instance, by means of human peripheral biocd 
lymphocytes (PBL) repopulation of severe combined immune deficiency (SCID) mice (for recent review., see 
Duchosal et al., 1992). 

15 Th e invention also relates to the use of the proteins or peptides of the invention, for the selection of 

recombinant antibodies by the process of repertoire cloning (Persson et al M 1991). 

Antibodies directed to peptides or single or specific oligomeric envelope proteins derived from a certain 
genotype may be used as a medicament, more particularly for incorporation into an immunoassay for the 
detection of HCV genotypes (for detecting the presence of HCV E1 or E2 antigen), for prognosing/monitoring of 

2 0 HCV disease, or as therapeutic agents. 

Alternatively, the present invention also relates to the use of any of the above-specified E1 or E2 
specific monoclonal antibodies for the preparation of an immunoassay kit for detecting the presence of E1 or E2 
antigen in a biological sample, for the preparation of a kit for prognosing/monitoring of HCV disease or for the 
preparation of a HCV medicament. 
25 The present invention also relates to the a method for in vitro diagnosis or detection of HCV antigen 

present in a biological sample, comprising at least the following steps : 

(i) contacting said biological sample with any of the E1 and/or E2 specific monoclonal antibodies 
as defined above, preferably in an immobilized form under appropriate conditions which allow 
the formation of an immune complex, 

3 0 (ii) removing unbound components, 

(Hi) incubating the immune complexes formed with heterologous antibodies, which specifically 
bind to the antibodies present in the sample to be analyzed, with said heterologous antibodies 
having conjugated to a detectable label under appropriate conditions, 

(iv) detecting the presence of said immune complexes visually or mechanically (e.g. by means of 
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densitometry, fluorimetry, coiorimetry). 
The present invention also relates to a kit for in vitro diagnosis of HCV antigen presem ;n a biological 
sample, comprising: 

at least one monoclonal antibody as defined above, wfh said antibody being preferentially 
5 immobilized on a solid substrate, 

a buffer or components necessary for producing the buffer enabling binding reaction between 
these antibodies and the HCV antigens present in the biological sample, 
a means for detecting the immune complexes formed in the preceding binding reaction, 
possibly also including an automated scanning and interpretation device for inferring the HCV 
1 0 antigens present in the sample from the observed binding pattern. 

The present invention also relates to a composition comprising E1 and/or E2 and/or E1/E2 recombinant 
HCV proteins purified according to the method of the present invention or a composition comprising at least one 
peptides as specified above for use as a medicament. 

The present invention more particulariy relates to a composition comprising at least one of the above- 
15 specified envelope peptides or a recombinant envelope protein composition as defined above, for use as a 
vaccine for immunizing a mammal, preferably humans, against HCV ; comprising administering a sufficient 
amount of the composition possibly accompanied by pharmaceuiically acceptable adjuvant(s), to produce an 
immune response. 

More particulariy, the present invention relates to the use of any of the compositions as described here 
2 0 above for the preparation of a vaccine as described above. 

Also, the present invention relates to a vaccine composition for immunizing a mammal, preferably 
humans, against HCV, comprising HCV single or specific oligomeric proteins or peptides derived from the E1 
and/or the E2 region as described above. 

Immunogenic compositions can be prepared according to methods known in the art. The present 
2 5 compositions comprise an immunogenic amount of a recombinant E1 and/or E2 and/or E1/E2 single or specific 
oligomeric proteins as defined above or E1 or E2 peptides as defined above, usually combined with a 
pharmaceutically acceptable carrier, preferably further comprising an adjuvant. 

The single or specific oligomeric envelope proteins of the present invention, either E1 and/or E2 and/or 
E1/E2, are expected to provide a particulariy useful vaccine antigen, since the formation of antibodies to either 
30 E1 or E2 may be more desirable than to the other envelope protein, and since the E2 protein is cross-reactive 
between HCV types and the E1 protein is type-specific. Cocktails including type 1 E2 protein and E1 proteins 
derived from several genotypes may be particulariy advantageous. Cocktails containing a molar excess of E1 
versus E2 or E2 versus E1 may also be particularly useful. Immunogenic compositions may be administered to 
animals to induce production of antibodies, either to provide a source of antibodies or to induce protective 
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immunity in the animai. 

Pharmaceutical^ acceptable carriers include any carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macrcmolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric 
5 amino acids, amino acid copolymers; and inactive virus particles. Such carriers are well known to those of 
ordinary skill in the art. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not limited to : 
aiuminim hydroxide (alum), N-acetyi-muramyl-L-threonyl-DHSoglutamine (thr-MDP) as found in U.S. Patent No. 
4,606,918, N-acetyl-normuramyl-L-alanyl-D-iscglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-OHSoglutaminyl- 
10 L-alanine-2-(1'-2'-dipaIm^ (MTP-PE) and RIBI, which 

contains three components extracted from bacteria, monophosphoryl lipid A, trehalose dimycolate, and cell wall 
skeleton (MPUTOM+CWS) in a 2% squaienaTween 80 emulsion. Any of the 2 components MPL, TDM or 
CWS may also be used alone or combined 2 by 2. Additionally, adjuvants such as Stimulon (Cambridge 
Bioscience, Worcester, MA) or SAF-1 (Syntax) may be used. Further, Complete Freund's Adjuvant (CFA) and 
1 5 Incomplete Freund's Adjuvant (IFA) may be used for non-human applications and research purposes. 

The immunogenic compositions typically will contain pharmaceutical^ acceptable vehicles, such as 
water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH 
buffering substances, preservatives, and the like, may be included in such vehicles. 

Typically, the immunogenic compositions are prepared as injectables, either as liquid solutions or 
2 0 suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be 
prepared. The preparation also may be emulsified or encapsulated in liposomes for enhanced adjuvant effect. 
The E1 and E2 proteins may also be incorporated into Immune Stimulating Complexes together with saponins, 
for example Quil A (iSCOMS). 

Immunogenic compositions used as vaccines comprise a 'sufficient amount 1 or 'an immunologically 

2 5 effective amount 1 of the envelope proteins of the present invention, as well as any other of the above mentioned 

components, as needed. 'Immunologically effective amount 1 , means that the administration of that amount to an 
individual, either in a single dose or as part of a series, is effective for treatment, as defined above. This amount 
varies depending upon the health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g. nonhuman primate, primate, etc.), the capacity of the individual's immune system to 

3 0 synthesize antibodies, the degree of protection desired, the formulation of the vaccine, the treating doctor's 

assessment of the medical situation, the strain of infecting HCV, and other relevant factors. It is expected that 
the amount will fall in a relatively broad range that can be determined through routine trials. Usually, the amount 
will vary from 0.01 to 1 000 jig/dose, more particularly from 0.1 to 100 t ug/dose. 

The single or specific oligomeric envelope proteins may also serve as vaccine carriers to present 
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homologous (e.g. T cell epitopes or B cell epitopes from the core, NS2 t NS3, NS4 or NS5 regions) or 
heterologous (non-HCV) haptens, in the same manner as Hepatitis B surface antigen (see European Patent 
Application 174,444). In this use, envelope proteins provide an immunogenic carrier capable of stimulating an 
immune response to haptens or antigens conjugated to the aggregate. The antigen may be conjugated either by 
5 conventional chemical methods, or may be cloned into the gene encoding El and/or E2 at a location 
corresponding to a hydrophilic region of the protein. Such hydrophyiic regions include the V1 region 
(encompassing amino acid positions 191 to 202), the V2 region (encompassing amino acid positions 213 to 
223), the V3 region (encompassing amino acid positions 230 to 242), the V4 region (encompassing amino acid 
positions 230 to 242), the V5 region (encompassing amino acid positions 294 to 303) and the V6 region 
0 (encompassing amino acid positions 329 to 336). Another useful location for insertion of haptens is the 
hydrophobic region (encompassing approximately amino acid positions 264 to 293). It is shown in the present 
invention that this region can be deleted without affecting the reactivity of the deleted E1 protein with antisera. 
Therefore, haptens may be inserted at the site of the deletion. 

The immunogenic compositions are conventionally administered parenterally, typically by injection, for 
5 example, subcutaneously or intramuscularly. Additional formulations suitable for other methods of administration 
include oral formulations and suppositories. Dosage treatment may be a single dose schedule or a multiple dose 
schedule. The vaccine may be administered in conjunction with other immunoregulatory agents. 

The present invention also relates to a composition comprising peptides or polypeptides as described 
above, for in vitro detection of HCV antibodies present in a biological sample. 
3 The present invention also relates to the use of a composition as described above for the preparation 

of an immunoassay kit for detecting HCV antibodies present in a biological sample. 

The present invention also relates to a method for in vitm diagnosis of HCV antibodies present in a 
biological sample, comprising at least the following steps : 

(i) contacting said biological sample with a composition comprising any of the envelope peptide 
> or proteins as defined above, preferably in an immobilized form under appropriate conditions 

which allow the formation of an immune complex, wherein said peptide or protein can be a 
biotinylated peptide or protein which is covalently bound to a solid substrate by means of 
streptavidin or avidin complexes, 
00 removing unbound components, 

(iii) incubating the immune complexes formed with heterologous antibodies, with said 
heterologous antibodies having conjugated to a detectable label under appropriate conditions, 

(iv) detecting the presence of said immune complexes visually or mechanically (e.g. by means of 
densitometry, fluorimetry, colorimetry). 

Alternatively, the present invention also relates to competition immunoassay formats in which 
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recombinant^ produced purified single or specific oligomeric protein E1 and/or E2 and/or E1/E2 proteins as 
disclosed above are used in combination with El and/or E2 peptides in order to compete for HCV antibodies 
present in a biological sample. 

The present invention also relates to a kit for determining the presence of HCV antibodies, in a 
5 biological sample, comprising : 

at least one peptide or protein composition as defined above, possibly in combination with 
other polypeptides or peptides from HCV or other types of HCV, with said peptides or proteins 
being preferentially immobilized on a solid substrate, more preferably on different microweils 
of the same ELISA plate, and even more preferentially on one and the same membrane strip, 
10 a buffer or components necessary for producing the buffer enabling binding reaction between 

these polypeptides or peptides and the antibodies against HCV present in the biological 
sample, 

means for delecting the immune complexes formed in the preceding binding reaction, 
possibly also including an automated scanning and interpretation device for inferring the HCV 
1 5 genotypes present in the sample from the observed binding pattern. 

The immunoassay methods according to the present invention utilize single or specific oligomeric 
antigens from the E1 and/or E2 domains that maintain linear (in case of peptides) and conformational epitopes 
(single or specific oligomeric proteins) recognized by antibodies in the sera from individuals infected with HCV. It 
is within the scope of the invention to use for instance single or specific oligomeric antigens, dimeric antigens, as 
20 well as combinations of single or specific oligomeric antigens. The HCV E1 and E2 antigens of the present 
invention may be employed in virtually any assay format that employs a known antigen to detect antibodies. Of 
course, a format that denatures the HCV conformational epitope should be avoided or adapted. A common 
feature of ail of these assays is that the antigen is contacted with the body component suspected of containing 
HCV antibodies under conditions that permit the antigen to bind to any such antibody present in the component. 

2 5 .Such conditions will typically be physiologic temperature, pH and ionic strenght using an excess of antigen. The 

incubation of the antigen with the specimen is followed by detection of immune complexes comprised of the 
antigen. 

Design of the immunoassays is subject to a great deal of variation, and many formats are known in the 
art. Protocols may, for example, use solid supports, or immunoprecipitation. Most assays involve the use of 

3 0 labeled antibody or polypeptide; the labels may be, for example, enzymatic, fluorescent, -chemiluminescent, 

radioactive, or dye molecules. Assays which amplify the signals from the immune complex are also known; 
examples of which are assays which utilize biotin and avidin or streptavidin, and enzyme-labeled and mediated 
immunoassays, such as ELISA assays. 

The immunoassay may be, without limitation, in a heterogeneous or in a homogeneous format, and of 
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a standard or competitive type. In a heterogeneous format, the polypeptide is typically bound to a solid matrix or 
support to facilitate separation of the sample from the polypeptide after incubation. Examples of solid supports 
that can be used are nitrocellulose (e.g., in membrane or microliter well form), polyvinyl chloride (e.g., in sheets 
or microliter wells), polystyrene latex (e.g., in beads or microliter plates, poiyvinyiidine fluoride (known as 
5 Immunolon™), diazotized paper, nylon membranes, activated beads, and Protein A beads. For example, 
Dynatech Immunolon™ 1 or Immunlon™ 2 microliter plates or 0.25 inch polystyrene beads (Precision Plastic 
Ball) can be used in the heterogeneous format. The solid support containing the antigenic polypeptides is 
typically washed after separating it from the test sample, and prior to detection of bound antibodies. Both 
standard and competitive formats are know in the art 
10 ,n a homogeneous format, the test sample is incubated with the combination of antigens in solution. 

For example, it may be under conditions that will precipitate any antigen-antibody complexes which are formed. 
Both standard and competitive formats for these assays are known in the art. 

In a standard format, the amount of HCV antibodies in the antibody-antigen complexes is directly 
monitored. This may be accomplished by determining whether labeled anti-xenogeneic (e.g. anti-human) 
15 antibodies which recognize an epitope on anti-HCV antibodies will bind due to complex formation. In a 
competitive format, the amount of HCV antibodies in the sample is deduced by monitoring the competitive effect 
on the binding of a known amcum of labeled antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-HCV antibody (or in the case of competitive assays, the amount of 
competing antibody) are detected by any of a number of known techniques, depending on the format. For 
20 example, unlabeled HCV antibodies in the complex may be detected using a conjugate of anti-xenogeneic Ig 
complexed with a label (e.g. an enzyme label). 

In an immunoprecfpitaticn or agglutination assay format the reaction between the HCV antigens and 
the antibody forms a network that precipitates from the solution or suspension and forms a visible layer or film of 
precipitate. If no anti-HCV antibody is present in the test specimen, no visible precipitate is formed. 
2 5 "^ere currently exist three specific types of particle agglutination (PA) assays. These assays are used 

for the detection of antibodies to various antigens when coated to a support. One type of this assay is the 
hemagglutination assay using red blood cells (RBCs) that are sensitized by passively adsorbing antigen (or 
antibody) to the RBC. The addition of specific antigen antibodies present in the body component, if any, causes 
the RBCs coated with the purified antigen to agglutinate. 
30 To eliminate potential non-specific reactions in the hemagglutination assay, two artificial carriers may 

be used instead of RBC in the PA. The most common of these are latex particles. However, gelatin particles 
may also be used. The assays utilizing either of these carriers are based on passive agglutination of the 
particles coated with purified antigens. 

The HCV single or specififc oligomeric E1 and/or E2 and/or E1/E2 antigens of the present invention 
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comprised of conformational epitopes will typically be packaged in the form of a kit for use in these 
immunoassays. The kit will normally contain in separate containers the native HCV antigen, control antibody 
formulations (positive and/or negative), labeled antibody when the assay format requires the same and signal 
generating reagents (e.g. enzyme substrate) if the label does not generate a signal directly. The native HCV 
5 antigen may be already bound to a solid matrix or separate with reagents for binding it to the matrix. Instructions 
(e.g. written, tape, CD-ROM, etc.) for carrying out the assay usually will be included in the kit 

Immunoassays that utilize the native HCV antigen are useful in screening blood for the preparation of a 
supply from which potentially infective HCV is lacking. The method for the preparation of the blood supply 
comprises the following steps. Reacting a body component, preferably blood or a blood component, from the 

10 individual donating blood with HCV E1 and/or E2 proteins of the present invention to allow an immunological 
reaction between HCV antibodies, if any, and the HCV antigen. (Detecting whether anti-HCV antibody - HCV 
antigen complexes are formed as a result of the reacting. Blood contributed to the blood supply is from donors 
that do not exhibit antibodies to the native HCV antigens, E1 or EZ 

In cases of a positive reactivity to the HCV antigen, it is preferable to repeat the immunoassay to lessen 

15 the possibility of false positives. For example, in the large scale screening of blood for the production of blood 
products (e.g. blood transfusion, plasma, Factor VIII, immunoglobulin, etc.) 'screening' tests are typically 
formatted to increase sensitivity (to insure no contaminated blood passes) at the expense of specificity; i.e. the 
false-positive rate is increased. Thus, it is typical to only defer for further testing those donors who are 
'repeatedly reactive'; Le. positive in two or more runs of the immunoassay on the donated sample. However, for 

2 0 confirmation of HCV-positivity, the 'confirmation' tests are typically formatted to increase specificity (to insure that 
no false-positive samples are confirmed) at the expense of sensitivity. Therefore the purification method 
described in the present invention for E1 and E2 will be very advantageous for including singie or specific 
oligomeric envelope proteins into HCV diagnostic assays. 

The solid phase selected can include polymeric or glass beads, nitrocellulose, microparticles, 

2 5 microwells of a reaction tray, test tubes and magnetic beads. "The signal generating compound can include an 

enzyme, a luminescent compound, a chromogen, a radioactive element and a chemiluminescent compound. 
Examples of enzymes include alkaline phosphatase, horseradish peroxidase and beia-galactosidase. Examples 
of enhancer compounds include biotin, anti-biotin and avidin. Examples of enhancer compounds binding 
members include biotin. anti-biotin and avidin. In order to block the effects of rheumatoid factor-like substances, 

3 0 the test sample is subjected to conditions sufficient to block the effect of rheumatoid factor-like substances. 

These conditions comprise contacting the test sample with a quantity of anti-human IgG to form a mixture, and 
incubating the mixture for a time and under conditions sufficient to form a reaction mixture product substantially 
free of rheumatoid factor-like substance. 

The present invention further contemplates the use of E1 proteins, or parts thereof, more particularly 
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HCV single or specific oligomeric El proteins as defined above, for in vitro monitoring HCV disease or 
prognosing the response to treatment (for instance with interferon) of patients suffering from HCV infection 
comprising: 

incubating a biological sample from a patient with hepatitis C infection with an E1 protein or a 
5 suitable part thereof under conditions allowing the formation of an immunological complex, 

removing unbound components, 

calculating the anti-E1 titers present in said sample (for example at the start of and/or during 
the course of (interferon) therapy), 

monitoring the natural course of HCV disease, or prognosing the response to treatment of 
10 said patient on the basis of the amount anti-El titers found in said sample at the start of 

treatment and/or during the course of treatment. 
Patients who show a decrease of 2, 3, 4 t 5, 7, 10, 15, or preferably more than 20 times of the initial 
anti-El titers could be concluded to be long-term, sustained responders to HCV therapy, more particularly to 
interferon therapy. It is illustrated in the Examples section, that an anti-E1 assay may be very useful for 
15 prognosing long-term response to IFN treatment, or to treatment of Hepatitis C virus disease in general. 

More particularly the following E1 peptides as listed in Table 3 were found to be useful for in vitro 
monitoring HCV disease or prognosing the response to interferon treatment of patients suffering from HCV 
infection: 

E1 -31 (SEQ ID NO 56) spanning amino acids 181 to 200 of the Core/El V1 region, 
20 E1-33 (SEQ ID NO 57) spanning amino acids 193 to 212 of the E1 region, 

E1 -35 (SEQ ID NO 58) spanning amino acids 205 to 224 of the E1 V2 region (epitope B), 
E1-35A (SEQ ID NO 59) spanning amino acids 208 to 227 of the E1 V2 region (epitope B), 
1bE1 (SEQ ID NO 53) spanning amino acids 192 to 228 of E1 regions (V1, C1, and V2 regions 
(containing epitope B)), 
25 E1 -51 (SEQ ID NO 66) spanning amino acids 301 to 320 of the E 1 region, 

E1-53 (SEQ ID NO 67) spanning amino acids 313 to 332 of the E1 C4 region (epitope A), 
E1 -55 (SEQ ID NO 68) spanning amino acids 325 to 344 of the El region, 
it is to be understood that smaller fragments of the above-mentioned peptides also fall within the scope 
of the present invention. Said smaller fragments can be easily prepared by chemical synthesis and can be 
3 0 ^ tested for their ability to be used in an assay as detailed above and in the Examples section. 

The present invention also relates to a kit (or monitoring HCV disease or prognosing the response to 
treatment (for instance to interferon) of patients suffering from HCV infection comprising: 

at least one E1 protein or E1 peptide, more particularly an El protein or E1 peptide as denned 
above, 
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a buffer or components necessary for producing the buffer enabling the binding reaction 
between these proteins or peptides and the anti-E1 antibodies present in a biological sample, 
means for detecting the immune complexes formed in the preceding binding reaction, 
possibly also an automated scanning and interpretation device for inferring a decrease of anti- 
5 E1 titers during the progression of treatment 

It is to be understood that also E2 protein and peptides according to the present invention can be used 
to a certain degree to monitor/prognose HCV treatment as indicated above for the E1 proteins or peptides 
because also the anti-E2 levels decrease in comparison to antibodies to the other HCV antigens. It is to be 
understood, however, that it might be possible to determine certain epitopes in the E2 region which would also 
10 be suited for use in an test for monitoring/prognosing HCV disease. 

The present invention also relates to a serotyping assay for detecting one or more serological types of 
HCV present in a biological sample, more particularly for detecting antibodies of the different types of HCV to be 
detected combined in one assay format, comprising at least the following steps : 

(i) contacting the biological sample to be analyzed for the presence of HCV antibodies of one or 
15 more serological types, with at least one of the E1 and/or E2 andfor E1/E2 protein 

compositions or at least one of the E1 or E2 peptide compositions as defined above, 
preferantiaily in an immobilized form under appropriate conditions which allow the formation of 
an immune complex, 

(ii) removing unbound components. 

20 (iii) incubating the immune, complexes formed with heterologous antibodies, with said 

heterologous antibodies being conjugated to a detectable label under appropriate conditions, 
(iv) detecting the presence of said immune complexes visually or mechanically (e.g. by means of 
densitometry, fluorimetry, coiorimetry) and inferring the presence of one or more HCV 
serological types present from the observed binding pattern. 
25 It is to be understood that the compositions of proteins or peptides used in this method are 

recombinant^ expressed type-specific envelope proteins or type-specific peptides. 

The present invention further relates to a kit for serotyping one or more serological types of HCV 
present in a biological sample, more particularly for detecting the antibodies to these serological types of HCV 
comprising: 

30 at least one E1 and/or E2 and/or E1/E2 protein or El or E2 peptide, as defined above, 

a buffer or components necessary for producing the buffer enabling the binding reaction 
between these proteins or peptides and the anti-E1 antibodies present in a biological sample, 
means for detecting the immune complexes formed in the preceding binding reaction, 
possibly also an automated scanning and interpretation device for detecting the presence of 
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one or more serological types present from the observed binding pattern. 
The present invention also relates to the use of a peptide or protein composition as defined above, for 
immobilization on a solid substrate and incorporation into a reversed phase hybridization assay, preferably for 
immobilization as parallel lines onto a solid support such as a membrane strip, for determining the presence or 
5 the genotype of HCV according to a method as defined above. Combination with other type-specific antigens 
from other HCV polyprotein regions also lies within the scope of the present invention. 

The present invention provides a method for purifying recombinant HCV single or specific oligometic envelope 
proteins selected from El and/or E2 and/or E1/E2 proteins which have been produced by a recombinant process 
comprising contacting said proteins with a disulphide bond cleavage or reducing agent. The contacting of the method of 
10 the invention may be carried out under partial cleavage or reducing conditions. Preferably, the disulphide bond cleavage 
agent is dithiothreitoi (DTT), preferably in a concentration range of 0. 1 to 50 mM, preferably 0. 1 to 20 rnM, more 
preferably 0.5 to 10 mM. Alternatively, the disulphide bond cleavage agent may be a detergent, such as Empigen-BB ( 
which is a mixture containing N-Docecyl-N,N-dimethylglycine as a major component), preferably at a concentration of 1 
to 10%, more preferably at a concentration of 3.5%. Mixtures of detergents, disulphide bond cleavage agents and/or 
1 5 reducing agents may also be used. In one embodiment, disulphide bond reformation is prevented with an SH group 

blocking agent, such as N-ethylmaleimide (NEM) or a derivative thereof. In a preferred embodiment, the disulphide bond 
reformation is blocked by use of low pH conditions. 

The present invention further provides a method as described herein, further involving the following steps: 
lysing recombinant El and/or E2 and/or E1/E2 expressing host cells, optionally in the presence of an SH blocking agent 
2 0 such as N-ethylmaleimide (NEM); recovering said HCV envelope proteins by affinity purification such as by means of 
lectin-chromatography, such as lentil-lectin chromatography, or by means of immunaaffinity using anti-E1 and/or anti-E2 
specific monoclonal antibodies; reducing or cleaving of the disulfide bonds with a disulphide bond cleaving agent, such 
as DTT, preferably also in the presence of an SH blocking agent such as NEM or Biotin-NEM; and, recovering the 
reduced El and/or E2 and/or E1/E2 envelope proteins by gelfiltration and optionally additionally by a subsequent Ni- 

2 5 IMAC chromatography and desalting step. 

The present invention provides a composition containing substantially isolated and/or purified, and/or isolated 
and/or purified recombinant HCV single or specific oligomeric recombinant envelope proteins selected from E1 and/or E2 
and/or E1/E2, which have preferably been isolated from the methods described herein. In a preferred embodiment, the 
recombinant HCV envelope proteins of the invention have been expressed in recombinant mammalian cells, such as 

3 0 vaccinia, recombinant yeast cells. 

The present invention provides a recombinant vector containing a vector sequence, a prokaryotic, eukaryorjc 
or viral promoter sequence and a nucleotide sequence allowing the expression of a single or specific oljgomeric El 
and/or E2 and/or E1/E2 protein, in operable combination. In one embodiment, the nucleotide sequence of the vector 
encodes a single HCV Et protein starting in the region between amino acid positions 1 and 192 and ending in the region 
3 5 between amino acid positions 250 and 400, more particularly ending in the region between positions 250 and 341 . even 
more preferably ending in the region between position 290 and 341. in another embodiment the nucleotide sequence of 
the vector encodes a single HCV El protein starting in the region between amino acid positions 1 1 7 and 192 and ending 
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in the region between amino add positions 263 and 400, more particularly ending in the region between positions 250 
and 326. In yet another embodiment, the nucleotide sequence of the vector encodes a single HCV El protein bearing a 
deletion of the first hydrophobic domain between positions 264 to 293, plus or minus 8 amino acids. In a further 
embodiment, the nucleotide sequence of the vector encodes a single HCV E2 protein starting in the region between 
5 amino acid positions 290 and 406 and ending in the region between amino acid positions 600 and 820, more particularly 
starting in the region between positions 322 and 406, even more preferably starting in the region between position 347 
and 406 and most preferably starting in the region between positions 364 and 406; and preferably ending at any of 
amino acid positions 623, 650, 661 , 673, 710, 715, 720, 746 or 809. The vector of the present invention, in one 
embodiment contains a oMerminal ATG codon and a 3'-terminal stop codon operably linked to the nucleotide sequence. 
10 The vector further contains, in one embodiment, a nucleotide sequence further containing at a factor Xa cleavage site 
and/or 3 to 10, preferably 6, histidine codons added 3'-terminally to the coding region. The vector of the present 
invention optionally contains a nucleotide sequence wherein at least one of the glycosylation sites present in the E1 or 
E2 proteins has been removed at the nucleic acid level. 

The present invention provides a nucleic acid containing any one of SEQ ID NOs: 3, 5, 7, 9, 1 1 , 13, 21, 23, 25, 
1 5 27, 29, 31 , 35, 37, 39. 41, 43, 45, 47 and 49, or parts thereof. The vector of the invention may preferably contain a 

nucleotide sequence containing a nucleic acid containing any one of SEQ ID NOs: 3, 5, 7, 9, 1 1 , 1 3, 21 , 23, 25, 27, 29, 
31 , 35, 37, 39, 41 , 43, 45. 47 and 49, or parts thereof. 

The composition of the present invention further contains recombinant HCV envelope proteins which have 
been expressed or are the expression product of a vector described herein. 

2 0 The present invention provides a host cell transformed with at least one recombinant vector as described 

herein, wherein the vector contains a nucleotide sequence encoding HCV El and/or E2 and/or E1/E2 protein as 
described herein in addition to a regulatory sequence operable in the host cell and capable of regulating expression of 
the HCV E1 and/or E2 and/or E1/E2 protein. Moreover, the present invention provides a ecombinant El and/or E2 
and/or E1/E2 protein expressed by a host cell of the invention. 
25 The present invention further provides a method as described herein and containing the following steps: 

growing a host cell as described herein which has been transformed with a recombinant vector as described herein in a 
suitable culture medium; causing expression of the vector nucleotide sequence of the vector, as described herein under 
suitable conditions; lysing the transformed host ceils, preferably in the presence of an SH group blocking agent, such as 
N-ethylmaleimide (NEM); recovering the HCV envelope protein by affinity purification by means of for instance lectin- 

3 0 chromatography or immunoafflnity chromatography using anti-El and/or anrj-E2 specific monoclonal antibodies, with 

said lectin being preferably lentiMectin. followed by, incubation of the eluate of the previous step with a disulphide bond 
cleavage agent, such as DTT, preferably also in the presence of an SH group blocking agent, such as NEM or Biotin- 
NEM; and, isolating the HCV single or specific oligomeric El and/or E2 and/or E1/E2 proteins by means of gelfiltration 
and possibly also by means of an additional NPMMAC chromatography and desalting step. 
35 The present invention provides a composition containing at least one of the following E1 and/or E2 peptides: 

E1-31 (SEQ ID NO 56) spanning amino acids 1 81 to 200 of the Core/E! VI region, 
E1-33 (SEQ ID NO 57) spanning amino acids 193 to 212 of the E1 region, 
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El -35 (SEQ ID NO 58) spanning amino adds 205 to 224 of the El V2 region (epitope B), 
E1-35A (SEQ ID NO 59) spanning amino acids 208 to 227 of the El V2 region (epitope B), 
tbEl (SEQ ID NO 53) spanning amino acids 192 to 228 of El regions (V1 , C1, and V2 regions 
(containing epitope B), 

5 El-51 (SEQ ID NO 66) spanning amino acids 301 to 320 of the El region, 

El -S3 (SEQ ID NO 67) spanning amino acids 313 to 332 of the Et C4 region (epitope A), 

E1-55 (SEQ ID NO 68) spanning amino acids 325 to 344 of the Et region. 

Env 67 or E2-67 (SEQ ID MO 72) spanning amino acid positions 397 to 416 of the E2 region (epitope 

A), 

10 Env 69 or E2-69 (SEQ ID NO 73) spanning amino add positions 409 to 428 of the E2 region (epitope 

A), 

Env 23 or 62-23 (SEQ ID NO 86) spanning positions 583 to 602 of the E2 region (epitope E), 
Env 25 or E2-25 (SEQ ID MO 87) spanning positions 595 to 61 4 of the E2 region (epitope E), 
Env 27 or E2-27 (SEQ ID NO 38) spanning positions 607 to 626 of the E2 region' (epitope E), 
1 5 Env 17B or E2-17B (SEQ ID NO 83) spanning positions 547 to 566 of the E2 region (epitope D), 

Env 1 38 or E2-1 3B (SEQ ID NO 32) spanning positions 523 to 542 of the E2 region (epitope C). 
The present invention provides a composition containing at least one of the following E2 conformational 

epitopes: 

epitope F recognized by monoclonal antibodies 15C8C1, 12D1 1 Ft, and 8G10D1H9, 
2 0 epitope G recognized by monoclonal antibody 9G3E6 ( 

epitope H (or C) recognized by monoclonal antibodies 10D3C4 and 4H6B2, 
epitope I recognized by monoclonal antibody 1 7F2C2. 

The present invention provides an El and/or E2 specific monoclonal antibody raised upon immunization with a 
composition as described herein. The antibodies of the present invention may be used, for example, as a medicament, 

2 5 for incorporation into an immunoassay kit for detecting the presence of HCV El or E2 antigen, for prognosis/monitoring 

of disease or for HCV therapy. The present invention provides for the use of an E1 and/or E2 specific monoclonal 
antibody as described herein tor the preparation of an immunoassay kit for detecting HCV £1 or E2 antigens, for the 
preparation of a kit for prognosing/monitoring of HCV disease or for the preparation of a HCV medicament 

The present invention provides a method for in vitro diagnosis of HCV antigen present in a biological sample, 

3 0 containing at least the following steps: 

(i) contacting said biological sample with an El and/or E2 specific monoclonal antibody as described herein, 
preferably in an immobilized form under appropriate conditions which allow the formation of an immune complex, 

(ii) removing unbound components, 

(iii) incubating the immune complexes formed with heterologous antibodies, with the heterologous antibodies 
3 5 being conjugated to a detectable label under appropriate conditions, 

(iv) detecting the presence of the immune complexes visually or mechanically. 

The present invention provides a kit for determining the presence of HCV antigens present in a biological 
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sample, which includes at least the following: at least one Et and/or E2 specific monoclonal antibody as described 
herein, preferably in an immobilized form on a solid substrate, a buffer or components necessary for producing the buffer 
enabling binding reaction between these antibdodies and the HCV antigens present in a biological sample, and 
optionally a means for detecting the immune complexes formed in the preceding binding reaction. 
5 The composition of the present invention may be provided in the form of a medicament 

The present invention provides a composition, as described herein for use as a vaccine for immunizing a 
mammal, preferably humans, against HCV, comprising administrating an effective amount of said composition being 
optionally accompanied by pharmaceutically acceptable adjuvants, to produce an immune response. 

The present Invention provides a method of using the composition, as described herein, for the preparation of 
10 a vaccine for immunizing a mammal, preferably humans, against HCV, comprising administrating an effective amount of 
said composition, optionally accompanied by pharmaceutically acceptable adjuvants, to produce an immune response. 

The present invention provides a vaccine composition for immunzing a mammal, preferably humans, against 
HCV, which contains an effective amount of a composition containing an El and/or E2 containing composition as 
described herein, optionally also accompanied by pharmaceutically acceptable adjuvants. 
15 The composition of the present invention may be provided in a form for in vitro detection of HCV antibodies 

present in a biological sample. The present invention also provides a method of preparing an immunoassay kit for 
detecting HCV antibodies present in a biological sample and a method of detecting HCV antibodies present in a 
biological sample using the kit of the invention to diagnose HCV antibodies present in a biological sampie. Such a 
method of the present invention includes at least the following steps: 
20 (i) contacting said biological sample with a composition as described herein, preferably in an immobilized form 

under appropriate conditions which allow the formation of an immune complex with HCV antibodies present in the 
biological sample, 

(ii) removing unbound components, 

(iii) incubating the immune complexes formed with heterologous antibodies, with the heterologous antibodies 

2 5 being conjugated to a detectable label under appropriate conditions, 

(iv) detecting the presence of the immune complexes visually or mechanically. 

The present invention provides a kit for determining the presence of HCV antibodies present in a biological 
sample, containing: at least one peptide or protein composition as described herein, preferably in an immobilized form 
on a solid substrate; a buffer or components necessary for producing the buffer enabling binding reaction between these 

3 0 proteins or peptides and the antibodies against HCV present in the biological sample; and, optionally, a means for 

detecting the immune complexes formed in the preceding binding reaction. 

The present invention provides a method of in vitro monitoring HCV disease or diagnosing the response of a 
patientsuffering from HCV infection to treatment , preferably with interferon, the method including: incubating a biological 
sample from the patient with HCV infection with an El protein or a suitable part thereof under conditions allowing the 
3 5 formation of an immunological complex; removing unbound components; calculating the anti-E1 titers present in the 
sample at the start of and during the course of treatment; monitoring the natural course of HCV disease, or diagnosing 
the response to treatment of the patient on the basis of the amount anti-El titers found in the sample at the start of 
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treatment and/or during the course of treatment. 

The present invention provides a kit for monitoring HCV disease or prognosing the response to treatment 
particularly with interferon, of patients suffering from HCV infection, wherein the kit contains: at least one £1 protein or 
E1 peptide, more particularly an El protein or Et peptide as described herein; a buffer or components necessary (or 
5 producing the buffer enabling the binding reaction between these proteins or peptides and the anti-El antibodies present 
in a biological sample; and optionally, means for detetecting the immune complexes formed in the preceding binding 
reaction, optionally, also an automated scanning and interpretation device for inferring a decrease of anti-El titers during 
the progression of treatment 

The present invention provides a serotyping assay for detecting one or more serological types of HCV present 
10 in a biological sample, more particularly for detecting antibodies of the different types of HCV to be detected combined in 
one assay format, including at least the following steps : (i) contacting the biological sample to be analyzed for the 
presence of HCV antibodies of one or more serological types, with at least one of the El and/or E2 and/or E1/E2 protein 
compositions as described herein or at least one of the Et or E2 peptide compositions described herein, preferentially in 
an immobilized form under appropriate conditions which allow the formation of an immune complex; (ii) removing 
1 5 unbound components; (iii) incubating the immune complexes formed with heterologous antibodies, with the 

heterologous antibodies being conjugated to a detectable label under appropriate conditions; and optionally, (iv) 
detecting the presence of said immune complexes visually or mechanically (e.g. by means of densitometry, fluorimetry, 
colorimetry) and inferring the presence of one or more HCV serological types present from the observed binding pattern. 

The present invention provides a kit for serotyping one or more serological types of HCV present in a biological 

2 0 sample, more particularly for detecting the antibodies to these serological types of HCV containing: at least one E1 

and/or E2 and/or E1/E2 protein as described herein or an Et or E2 peptide as described herein; a buffer or components 
necessary for producing the buffer enabling the binding reaction between these proteins or peptides and the anti-El 
antibodies present in a biological sample;; optionally, means for detecting the immune complexes formed in the 
preceding binding reaction, optionally, also an automated scanning and interpretation device for detecting the presence 
25 of one or more serological types present from the observed binding pattern. 

The- present invention provides a peptide or protein composition as described herein, for immobilization on a 
solid substrate and incorporation into a reversed phase hybridization assay, preferably for immobilization as parallel lines 
onto a solid support such as a membrane strip, for determining the presence or the genotype of HCV according to a 
method as described herein. 

30 The present invention provides a therapeutic vaccine composition containing a therapeutic effective amount of: 

a composition containing at least one purified recombinant HCV single or specific oligomeric recombinant 
envelope proteins selected from the group of an Et protein and an E2 protein; and optionally a pharmaceutical^ 
acceptable adjuvant. The HCV envelope proteins of the vaccine of the present invention are optionally produced by 
recombinant mammalian cells or recombinant yeast cells. The invention provides a therapeutic vaccine composition 

3 5 containing a therapeutically effective amount of a composition containing at least one of the following Et and E2 

peptides: 

E1-31 (SEQ ID NO:56) spanning amino acids 181 to 200 of the Core/E1 Vt region. 
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Et-33 (SEQ ID NO:57) spanning amino acids 193 to 212 of the El region, 
E1-35 (SEQ ID NO:58) spanning amino acids 205 to 224 of the El V2 region (epitope B), 
E1-35A (SEQ ID NO:59) spanning amino acids 208 to 227 of the El V2 region (epitope B), 
1bE1 (SEQIDNO:53) spanning amino acids 192 to 228 of El regions VI, CI, and V2 regions (containing 
5 epitope B), 

El-51 (SEQ ID NO:66) spanning amino acids 301 to 320 of the El region, 

E1-53 (SEQ ID NO:67) spanning amino acids 313 to 332 of the E1 C4 region (epitope A), 

E1-55 (SEQ ID NO:68) spanning amino acids 325 to 344 of the El region, 

Env 67 or E2-67 (SEQ ID NO:72) spanning amino add positions 397 to 41 8 of the E2 region (epitope A), 
10 Env 69 or E2-69 (SEQ ID NO:73) spanning amino acid positions 409 to 428 of the E2 region (epitope A), 

Env 23 or E2-23 (SEQ ID NO:86) spanning positions 583 to 602 of the E2 region (epitope E), 
Env 25 or E2-25 (SEQ ID NO:87) spanning positions 595 to 614 of the E2 region (epitope E), 
Env 27 or E2-27 (SEQ ID NO:88) spanning positions 507 to 626 of the E2 region (epitope E), 
Env 178 or E2-178 (SEQ ID NO:83) spanning positions 547 to 586 of the E2 region (epitope D), and 
15 Env 13B or E2-13B (SEQ ID NO:82) spanning positions 523 to 542 of the E2 region (epitope C). 

The present invention provides a method of treating a mammal, such as a human, infected with HCV 
comprising administering an effective amount of a composition as described herein, such as the above described 
vaccines, and optionally, a pharmaceutical^ acceptaole adjuvant In one emcodiment, the composition of the invention 
is administered in combination with or at a time in conjunction with antiviral therapy, either soon prior to or subsequent to 
20 or with administration of the composition of the invention. 

The present invention provides a composition containing at least one purified recombinant HCV recombinant 
envelope proteins selected from the group of an El protein and an E2 protein, and optionally an adjuvant In a preferred 
embodiment, the composition contains at least one of the following El and E2 peptides: 

E1-31 (SEQ ID NO:56) spanning amino acids 131 to 200 of the Core/El VI region, 
25 E1 -33 (SEQ ID NO:57) spanning amino acids 1 93 to 21 2 of the El region, 

E1 -35 (SEQ ID NO:58) spanning amino acids 205 to 224 of the El V2 region (epitope B), 
E1-35A (SEQ ID NO:59) spanning amino acids 208-to 227 of the E1 V2 region (epitope B), 
IbEl (SEQ ID NO:53) spanning amino acids 192 to 223 of E1 regions V1, CI , and V2 regions (containing 
epitope B), 

3 0 E1 -5 1 (SEQ ID NO:66) spanning amino acids 301 to 320 of the E1 region, 

El -53 (SEQ ID NO:67) spanning amino acids 3 1 3 to 332 of the E1 C4 region (epitope A), 
El -55 (SEQ ID NO:68) spanning amino acids 325 to 344 of the E1 region, 

Env 67 or E2-67 (SEQ ID NO:72) spanning amino acid positions 397 to 418 of the E2 region (epitope A), 
Env 69 or E2-69 (SEQ ID NO:73) spanning amino acid positions 409 to 428 of the E2 region (epitope A), 
3 5 Env 23 or E2-23 (SEQ ID NO:86) spanning positions 583 to 602 of the E2 region (epitope E), 

Env 25 or E2-25 (SEQ ID NO:87) spanning positions 595 to 61 4 of the E2 region (epitope E), 
Env 27 or E2-27 (SEQ ID NO:88) spanning positions 607 to 626 of the E2 region (epitope E). 
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Env 1 78 or E2-1 78 (SEQ ID NO:32) spanning positions 547 to 586 of the E2 region (epitope D), and 
Env 13B or E2-13B (SEQ ID NO:82) spanning positions 523 to 542 of the E2 region (epitope C). 
The present invention provides a therapeutic composition for inducing HCV-specific antibodies containing a 
therapeutic effective amount of a composition containing an E1/E2 complex formed from purified recombinant HCV 
5 single or specific oiigomeric recombinant Et or E2 proteins; and optionally a pharmaceutically acceptable adjuvant The 
recombinant HCV envelope proteins of the invention may be produced by recombinant mammalian cells or recombinant 
HCV envelope proteins are produced by recombinant yeast cells. The present invention provides a method of treating a 
mammal, such as a human, infected with HCV including administering an effective amount of a composition as described 
herein and, optionally, a pharmaceutical acceptable adjuvant The present invention provides a therapeutic 
1 0 composition for inducing HCV-specific antibodies containing a therapeutic effective amount of a composition containing 
at least one purified recombinant HCV single or specific oiigomeric recombinant envelope protein selected from the 
group of an E1 protein and an E2 protein; and optionally a pharmaceutically acceptable adjuvant 
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Figure and Table legends 

Figure 1 : Restriction map of plasmid pgpt ATA 1 8 

Figure 2 : Restriction map of plasmid pgs ATA 1 8 
5 Figure 3: Restriction map of plasmid pMS 65 

Figure 4 : Restriction map of plasmid pv HCV-1 1 A 

Figure 5 : Anti-El levels in non-responders to IFN treatment 

Figure 6 : Anti-E1 levels in responders to IFN treatment 

Figure 7 : Anti-E1 levels in patients with complete response to IFN treatment 
1 0 Figure 8 : Anti-E1 levels in incomplete responders to IFN treatment 

Figure 9 : Anti-E2 levels in non-responders to IFN treatment 

Figure 10 : Anti-E2 levels in responders to IFN treatment 

Figure 1 1 : Anti-E2 levels in incomplete responders to IFN treatment 

Figure 12 : Anti-E2 levels in complete responders to IFN treatment 
1 5 Figure 1 3 : Human anti-E1 reactivity competed with peptides 

Figure 1 4 : Competition of reactivity of anti-E1 monoclonal antibodies with peptides 

Figure 1 5 : Anti-E 1 (epitope 1 ) levels in non-responders to IFN treatment 

Figure 16 : Anti-El (epitope 1) levels in responders to IFN treatment 

Figure 1 7 : Anti-E1 (epitope 2) levels in non-responders to IFN treatment 
2 0 Figure 18 : Anti-El (epitope 2) levels in responders to IFN treatment 

Figure 19 : Competition of reactivity of anti-E2 monoclonal antibodies with peptides 

Figure 20: Human anti-E2 reactivity competed with peptides 

Figure 21 : Nudeic acid sequences of the present invention. The nucleic acid sequences encoding an E1 
or E2 protein according to the present invention may be translated (SEQ ID NO 3 to 13 T 21- 
25 31, 35 and 41-49 are translated in a reading frame starting from residue number 1, SEQ ID 

NO 37-39 are translated in a reading frame starting from residue number 2), into the amino 
acid sequences of the respective E1 or E2 proteins as shown in the sequence listing. 

Figure 22: EUSA results obtained from lentil lectin chromatography eluate fractions of 4 different E1 
purifications of cell lysates infected with wHCV39 (type 1b), wHCV40 (type 1b), wHCV62 
. 3 0 (type 3a), and wHCV63 (type 5a). 

Figure 23: Elution profiles obtained from the lentil lectin chromatography of the 4 different E1 constructs 
on the basis of the values as shown in Figure 22. 

Figure 24: EUSA results obtained from fractions obtained after gelfiltration chromatography of 4 different 
E1 purifications of cell lysates infected with wHCV39 (type 1b), wHCV40 (type 1b), wHCV62 
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(type 3a), and wHCV63 (type 5a). 
Figure 25; Profiles obtained from purifications of £1 proteins of type 1b (1), type 3a (2), and type 5a (3) 
(from RK13 cells infected with wHCV39. wHCV62, and wHCV63, respectively; purified on 
lentil lectin and reduced as in example 52 - 5.3) and a standard (4). The peaks indicated with 
5 1 S', and "J, represent pure El protein peaks (see Figure 24, E1 reactivity mainly in fractions 

26 to 30). 

Figure 26: Silver staining of an SDS-PAGE as described in example 4 of a raw lysate of E1 wHCV40 
(type 1b) (lane 1), pool 1 of the gelfiltration of wHCV40 representing fractions 10 to 17 as 
shown in Figure 25 (lane 2), pool 2 of the geifiltration of wHCV40 representing fractions 18 to 

10 25 as shown in Figure 25 (lane 3), and El pool (fractions 26 to 30) (lane 4). 

Figure 27: Streptavidine-alkaline phosphatase blot of the fractions of the gelfiltration of E1 constructs 39 
(type 1b) and 62 (type 3a). The proteins were labelled with NEM-bioSn. Lane 1: start 
gelfiltration construct 39, lane 2: fraction 26 construct 39, lane 3: fraction 27 construct 39, lane 
4: fraction 28 construct 39, lane 5: fraction 29 construct 39, lane 6: fraction 30 construct 39, 

15 lane 7 fraction 31 construct 39, lane 8: molecular weight marker, lane 9: start gelfiltration 

construct 62, lane 10: fraction 26 construct 62, lane 11: fraction 27 construct 62. lane 12: 
fraction 28 construct 62 ? iane :3: fraction 29 construct 62, lane 14: fraction 30 construct 62, 
lane 15: fraction 31 construct 62. 
Figure 28: Siver staining of an SDS-PAGE gel of the gelfiltration fractions of wHCV-39 (E1s, type 1b) 

20 and wHCV-62 (E1s, type 3a) run under identical conditions as Figure 26. Lane 1: start 

gelfiltration construct 39, lane 2: fraction 26 construct 39, lane 3: fraction 27 construct 39, lane 
4: fraction 28 construct 39, lane 5: fraction 29 construct 39, lane 6: fraction 30 construct 39, 
lane 7 fraction 31 construct 39, iane 8: molecular weight marker, lane 9: start gelfiltration 
construct 62, iane 10: fraction 26 construct 62, lane 11: fraction 27 construct 62, lane 12: 

2 5 fraction 28 construct 62, lane 13: fraction 29 construct 62, lane 14: fraction 30 construct 62, 

lane 15: fraction 31 construct 62. 
Figure 29: Western Blot analysis with anti-E1 mouse monoclonal antibody 5E1A10 giving a complete 
overview of the purification procedure. Lane 1: crude lysate, Lane "2: flow through of lentil 
chromagtography, Lane 3: wash with Empigen B8 after lentil chromatography, Lane 4: Eluate 

30 , of lentil chromatography, Lane 5: Flow through during concentration of the lentil eluate, Lane 

6: Pool of E1 after Size Exclusion Chromatography (gelfiltration). 
Figure 30: OOaw profile (continuous line) of the lentil lectin chromatography of E2 protein from RK1 3 cells 
infected with WHCV44. The dotted line represents the E2 reactivity as detected by ELISA (as 
in example 6). 
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Figure 31 A: ODan profile (continuous line) of the lentil-lectin gelfiltration chromatography E2 protein pool 
from RK13 cells infected with wHCV44 in which the E2 pool is applied immediately on the 
gelfiltration column (non-reduced conditions). The dotted line represents the E2 reactivity as 
detected by ELISA (as in example S). 
5 Figure 31 B: ODsbo profile (continuous line) of the lentil-lectin gelfiltration chromatography E2 protein pool 
from RK13 ceils infected with wh'CV44 in which the E2 pool was reduced and blocked 
according to Example 5.3 (reduced conditions). The dotted line represents the E2 reactivity as 
detected by EUSA (as in example 6). 

Figure 32: NP + -IMAC chromatography and EUSA reactivity of the E2 protein as expressed from 

1 o wHCV44 after gelfiltration under reducing conditions as shown in Figure 31 B. 

Figure 33: Silver staining of an SDS-PAGE of 0.5 ug of purified E2 protein recovered by a 200 mM 
imidazole elution step (lane 2) and a 30mM imidazole wash (lane 1) of the NiNMAC 
chromatography as shown in Figure 32. 

Figure 34: OD profiles of a desalting step of the purified E2 protein recovered by 200 mM immidazole as 
1 5 shown in Figure 33, intended to remove imidazole. 

Figure 35A: Antibody levels to the different HCV antigens (Core 1, Core 2, E2HCVR, NS3) for NR and 
LTR followed during treatment and over a period of 5 to 12 monihs after treatment determined 
by means of the UAscan method. The average values are indicated by the curves with the 
open squares. 

2 0 Figure 35B: Antibody levels to the different HCV antigens (NS4, NS5, E1 and E2) for NR and LTR followed 

during treatment and over a period of 6 to 12 months after treatment determined by means of 
the UAscan method. The avergae vallues are indicated by the curve with the open squares. 



Figure 36: Average E1 antibody (E1 Ab) and E2 antibody (E2Ab) levels in the LTR and NR groups. 

2 5 Figure 37: Averages E1 antibody (E1 Ab) levels for non-responders (NR) and long term responders (LTR) 

for type 1b and type 3a. 
Figure 38: Relative map positions of the anti-E2 monoclonal antibodies. 

Figure 39: Partial deglycosylation of HCV E1 envelope protein. The lysate of wHCVIOA-infected RK13 
cells were incubated with different concentrations of glycosidases according to the 

3 0 manufacturer's instructions. Right panel: Glycopeptidase F (PNGase F). Left panel: 

Endoglycosidase H (Endo H). 
Figure 40: Partial deglycosylation of HCV E2 envelope proteins. The lysate of wHCV64-infected (E2) 
and WHCV41 -infected (E2s)RK13 cells were incubated with different concentrations of 
Glycopeptidase F (PNGase F) according to the manufacturer's instructions. 
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Figure 41: In vitro mutagenesis of HCV Et glycoproteins. Map of the mutated sequences and the 
creation of new restriction sites. 
' Figure 42A: In vitro mutagenesis of HCV E1 glycoprotein (part 1 ). First step of PCR amplification. 
Figure 42B: In vitro mutagensis of HCV E1 glycoprotein (part 2). Overlap extension and nested PCR. 
5 Figure 43: In vitro mutagesesis of HCV E1 glycoproteins. Map of the PCR mutated fragments (GLY4 
and OVR-#) synthesized during the first step of amplification. 
Figure 44A: Analysis of E1 glycoprotein mutants by Western blot expressed in HeLa (left) and RK1 3 (right) 
cells. Lane 1: wild type W (vaccinia virus), Lane 2: original E1 protein (wHCV-10A), Lane 3: 
El mutant GIy-1 (wHCV-81), Lane 4: E1 mutant Gly-2 (wHCV-82), Lane 5: El mutant Gly-3 
10 (wHCV-83), Lane 6: E1 mutant Gly-4 (wHCV-84), Lane 7: E1 mutant Gly-5 (wHCV-85), 

Lane 8: E1 mutant Gly-6 (wHCV-86). 
Figure 448: Analysis of E1 glycosylate mutant vaccinia viruses by PCR amplification/restriction. Lane 1: 
E1 (WHCV-10A), BspE I, lane 2: E1.GLY-1 (wHCV-31), BspE I Lane 4: E1 (wHCV-10A), 
Sac /, Lane 5: E1 .GLY-2 (wHCV-82), Sac /, Lane 7: E1 (wHCV-1 OA), Sac I, Lane 8: E1 .GLY- 
15 3 (wHCV-83), Sac!, Lane 10: E1 (wHCV-10A), Stu /, Lane 11: E1 .GLY-4 (wHCV-84), Stu /, 

Lane 13: E1 (wHCV-10A), Sma /, Lane 14: E1 .GLY-5 (wHCV-85), Sma I Lane 16: E1 
(WHCV-10A), Stu f, Lane 17: E1.GLY-6 (wHCV-86). Stu /, Lane 3 - 6 - 9 - 12 - 15 : Low 
Molecular Weight Marker, pBluescript SKr, Msp I. 
Figure 45: SOS polyacrylamide gel electrophoresis of recombinant E2 expressed in S. cerevisiae . 
2 0 Innoculates were grown in leucine selective medium for 72 hrs. and diluted 1/15 in complete 

medium. After 10 days of culture at 28°C, medium samples were taken. The equivalent of 
200 jil of culture supernatant concentrated by speedvac was loaded on the gel. Two 
independent transformants were analysed. 
Figure 46: SOS polyacrylamide gel electrophoresis of recombinant E2 expressed in a glycosylation 

2 5 deficient S. cerevisiae mutant. Innoculae were grown in leucine selective medium for 72 hrs. 

and diluted 1/15 in complete medium. After 10 days of culture at 28°C, medium samples were 
taken. The equivalent of 350 pi of culture supernatant, concentrated by ion exchange 
chromatography, was loaded on the gel. 
Figure 47: Profile of chimpanzees and immunization schedule. 

3 0 Figure 48: Cellular response after 3 immunizations. 

Figure 49: Evolution of cellular response upon repeated E1 immunizations. 
Figure 50: Cellular response upon NS3 immunizations. 

Table 1 : Features of the respective clones and primers used for amplification for constructing the 

different forms of the E1 protein as despected in Example 1. 
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Table 2 : Summary of Anti-E1 tests 

Table 3 : Synthetic peptides for competition studies 

Table 4: Changes of envelope antibody levels over time. 

Table 5: Difference between LTR and NR 

5 Table 6: Competition experiments between murine E2 monoclonal antibodies 

Table 7: Primers for construction of E 1 glycosylate mutants 

Table 8: Analysis of E1 glycosylate mutants by BJSA 

Table 9: Profile of adjuvanted E1 8alh/c mice. 

Table 1 0: Humoral responses: No. of immunizations required for different E1 -antibodies levels. 
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Example 1: Cloning and expression of the hepatitis C virus E1 protein 

1 . Construction of vaccinia virus recombination vectors 

5 The pgptATA16 vaccinia recombination plasmid is a modified version of pATA18 (Siunnenberg et al, 

1 988) with an additional insertion containing the Ecoli xanthine guanine phosphoribosyi transferase gene under 
the control of the vaccinia virus 13 intermediate promoter (Figure 1 ). The plasmid pgsATA18 was constructed by 
inserting an oligonucleotide linker with SEQ ID M0 1/94, containing stop codons in the three reading frames, Into 
the Pst I and Hindlll-cut pATA1 8 vector This created an extra Pac I restriction site (Figure 2). The original Hindill 
10 site was not restored. 

Oligonucleotide linker with SEQ ID N0 1/94: 

5 ' G GCATGC .AAGCTT AATTAATT 3 * 

3 1 ACGTC CGTACG TTCGAA TTAATTAA TCGA 5 ' 

15 

Pst I SphI Hindill Pac I (Hindill) 

In order to facilitate rapid and efficient purification, by means of NP* chelation of engineered histidine . 
stretches fused to the recombinant proteins, the vaccinia recombination vector pMS66 was designed to. express - 

20 secreted proteins with an additional carboxy-terminal' histidine tag. An. oligonucfeotida- linker, with SEQ ID NO ; 
2/95, containing unique sites for 3 restriction enzymes generating blunt ends' (Sma I, Stu I and Pml 1/BbrPQ was. 
synthesized in such a way that the carboxy-terminal end of any. cDNA could.be inserted in frame with' a 
sequence encoding the protease factor Xa cleavage site followed by a nucleotide sequence encoding 6 
histidihes and 2 stop codons (a new Pac I restriction site was also created downstream the 3'end). This 

25 oligonucleotide with SEQ ID NO 2/95 was introduced between the Xma I and Pst I sites of pgptATAta (Figure . 
3)- 

Oligonucleotide linker with SEQ ID NO 295: 

'5' CCGGG GAGGCCTGCACGTGATCG AGGGCAGACACGATCACC AC C ATC ACT AATAGTTAATT AA CTGCA 
3' 

30 3' C CTCC GGAC GTGC AC T AGCTCCCGTCTGTGGTAGTGGTGGT A-GTGATTATCAATT AATT G 

Xmal 

Plasmid pgptATA-18 contained within Escherichia coli MC1 061 (lambda) has been 
deposited under the terms of the Budapest Treaty at BCCM/LMBP (Belgian Coordinated 
35 Collections of microorganisms/Laboratorium voor Moleculaire Biologie 

Plasmidencollectie, Universiteit Gent, K.L Ledeganckstraat 35, B-9000 Ghent, Belgium), 
and bears accession number LMBP4486. Said deposit was made on January 9, 2002. 

Example 2. Construction of HCV recombinant plasmtds 
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2.1 . Constructs encoding different forms of the E1 protein 

Polymerase Chain Reaction (PGR) products were derived from the serum samples by RNA preparation 
and subsequent reverse-transcription and PCR as described previously (Stuyver et ai. t 1993b). Table 1 shows 
5 the features of the respective clones and the primers used for amplification. The PCR fragments were cloned 
into the Sma kut pSP72 (Promega) plasmids. The following clones were selected for insertion into vaccinia 
reombination vectors: HCC19A (SEQ ID NO 3), HCCI10A (SEQ ID NO 5), HCCI11A (SEQ ID NO 7), HCCI12A 
(SEQ ID NO 9), HCCI13A (SEQ ID NO 11), and HCCI17A (SEQ ID NO 13) as depicted in Figure 21. cDNA 
fragments containing the E1 -coding regions were cleaved by EcoRI and Hindlll restriction from the respective 
1 0 pSP72 plasmids and inserted into the EcoRl/Hindlll-cut pgptATA-18 vaccinia recombination vector (described in 
example 1), downstream of the 11K vaccinia virus late promoter. The respective plasmids were designated 
pvHCV-9A t pvHCV-10A, pvHCV-1 1 A, pvHCV-12A, pvHCV-13A and pvHCV-17A, of which pvHCV-1 1 A is shown 
in Figure 4. 

15 2.2. Hydrophobic region E1 deletion mutants 

Clone HCCI37, containing a deletion of codons Asp264 to Vai287 (nucleotides 790 to 861, region 
encoding hydrophobic domain I) was generated as follows: 2 PCR fragments were generated from clone 
HCCI10A with primer sets HCPr52 (SEQ ID NO 16)/HCPr107 (SEQ ID NO 19) and HCPr108 (SEQ ID NO 
20 20J/HCPR54 (SEQ ID NO 18). These primers are shown in Figure 21. The two PCR fragments were purified 
from agarose gel after electrophoresis and 1 ng of each fragment was used together as template for PCR by 
means of primers HCPr52 (SEQ ID NO 16) and HCPr54 (SEQ ID NO 18). The resulting fragment was cloned 
into the Sma kut pSP72 vector and clones containing the deletion were readily identified because of the 
deletion of 24 codons (72 base pairs). Plasmid pSP72HCCt37 containing clone HCC137 (SEQ ID 15) was 

2 5 selected. A recombinant vaccinia plasmid containing the full-length E1 cDNA lacking hydrophobic domain I was 

constructed by inserting the HCV sequence surrounding the deletion (fragment cleaved by Xma I and BamH I 
from the vector pSP72-HCCI37) into the Xma I-Bam H I sites of the vaccinia plasmid pvHCV-10A. The resulting 
plasmid was named pvHCV-37. After confirmatory sequencing, the amino-terminal region containing the internal 
deletion was isolated from this vector pvHCV-37 (cleavage by EcoR I and BstE II) and reinserted into the Eco Rl 

3 0 and Bst Ell-cut pvHCV-11A plasmid. This construct was expected to express an E1 protein with both 

hydrophobic domains deleted and was named pvHCV-38. The E1 -coding region of clone HCCI38 is represented 
by SEQ ID NO 23. 

As the hydrophilic region at the E1 carboxyterminus (theoretically extending to around amino acids 
337-340) was not completely included in construct pvHCV-38, a larger E1 region lacking hydrophobic domain I 
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was isolated from the pvHCV-37 plasmid by EcoR I/Bam HI cleavage and cloned into an EcoRI/BamHkut 
pgsATA-18 vector. The resulting plasmid was named pvHCV-39 and contained clone HCCI39 (SEQ ID NO 25). 
TTie same fragment was cleaved from the pvHCV-37 vector by BamH I (of which the sticky ends were filled with 
Klenow DNA Polymerase I (Boehringer)) and subsequently by EcoR I (5* cohesive end). This sequence was 
5 inserted into the EcoRI and Bbr Pkut vector pMS-66. This resulted in clone HCCI40 (SEQ ID NO 27) in plasmid 
pvHCV-40, containing a 6 histidine tail at its carboxy-terminal end 

2.3. E1 of other genotypes 

10 Clone HCCI62 (SEQ ID NO 29) was derived from a type 3a-infected patient with chronic hepatitis C 

(serum BR36, clone BR36-9-13, SEQ ID NO 19 in WO 94/25601, and see also Stuyver et al. 1993a) and 
HCCI63 (SEQ ID NO 31) was derived from a type 5a-inrected child with post-transfusion hepatitis (serum BE95, 
clone PC-4-1 , SEQ ID NO 45 in WO 94/25601). 

15 2.4. E2 constructs 

The HCV E2 PCR fragment 22 was obtained from serum BE11 (genotype 1b) by means of primers 
HCPr109 (SEQ ID NO 33) and HCPr72 (SEQ ID NO 34) using techniques of RNA preparation, reverse- 
transcription and PCR, as described in Stuyver et al., 1993b t and the fragment was cloned into the Sma kut 
2 0 pSP72 vector Clone HCCI22A (SEQ ID NO 35) was cut with Ncol/AIwNI or by BamHI/AlwNI and the sticky ends 
of the fragments were blunted (Ncol and BamHI sites with Klenow DNA Polymerase I (Boehringer), and AlwNI 
with T4 DNA polymerase (Boehringer)). The BamHI/AlwNI cDNA fragment was then inserted into the vaccinia 
pgsATA-1 8 vector that had been linearized by EcoR I and Hind HI cleavage and of which the cohesive ends had 
been filled with Klenow DNA Polymerase (Boehringer). The resulting plasmid was named pvHCV-41 and 

2 5 encoded the E2 region from amino acids Met347 to Gln673, including 37 amino acids (from Met347 to Gly383) 

of the E1 protein that can serve as signal sequence. The same HCV cDNA was inserted into the EcoR I and Bbr 
Pkut vector pMS66, that had subsequently been blunt ended with Klenow DNA Polymerase. The resulting 
plasmid was named pvHCV-42 and also encoded amino acids 347 to 683. The Ncol/AIwNI fragment was 
inserted in a similar way into the same sites of pgsATA-18 (pvHCV-43) or pMS-66 vaccinia vectors (pvHCV-44). 

3 0 pvHCV-43 and pvHCV-44 encoded amino acids 364 to 673 of the HCV polyprotein, of which amino acids 364 to 

383 were derived from the natural carboxyterminai region of the E1 protein encoding the signal sequence for E2 ; 
and amino acids 384 to 673 of the mature E2 protein. 

2.5. Generation of recombinant HCV-vaccinia viruses 
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Rabbit kidney RK13 cells (ATCC CCL 37), human osteosarcoma 143B thymidine kinase deficient (TIC) 
(ATCC CRL 8303), Hela (ATCC CCL 2), and Hep G2 (ATCC HB 8065) cell lines were obtained from the 
American Type Culture Collection (ATCC, Rockville, Md, USA). The cells were grown in Dulbecco's modified 
5 Eagle medium (DMEM) supplemented with 10 % foetal calf serum, and with Earie's salts (EMEM) for RK13 and 
143 B (TK-), and with glucose (4 g/I) for Hep G2. The vaccinia virus WR strain (Western Reserve, ATTC VR1 19) 
was routinely propagated in either 143B or RK13 cells, as described previously (Panicaii & Paoletti, 1982; Piccini 
et a)., 1987; Mackett et ah, 1982, 1984, and 1986). A confluent monolayer of 143B cells was infected with wild 
type vaccinia virus at a multiplicity of infection (m.o.i.) of 0.1 (= 0.1 plaque forming unit (PFU) per cell): Two 

10 hours later, the vaccinia recombination plasmid was transfected into the infected cells in the form of a calcium 
phosphate coprecipitate containing 500 ng of the plasmid DNA to allow homologous recombination (Graham & 
van der Eb, 1973; Mackett et al. t 1985). Recombinant viruses expressing the Escherichia coli xanthine-guanine 
phosphoribosyi transferase (gpt) protein were selected on rabbit kidney RK13 cells incubated in selection 
medium (EMEM containing 25 ug/ml mycophenolic acid (MPA), 250 ,ug/ml xanthine, and 15 ,ug/ml 

15 hypoxanthine; Falkner and Moss, 1988; Janknecht et al, 1991). Single recombinant viruses were purified on 
fresh monolayers of RK13 cells under a 0.9% agarose overlay in selection medium. Thymidine kinase deficient 
(TIC) recombinant viruses were selected and then piaque purified on fresh monolayers of human 1433 cells (TK- 
) in the presence of 25 jig/ml 5-bromo-2'-deoxyuridine. Stocks of purified recombinant HCV-vaccinia viruses 
were prepared by infecting either human 143 B or rabbit RK13 cells at an m.oi of 0.05 (Mackett et ai, 1988). 

2 0 The insertion of the HCV cDNA fragment in the recombinant vaccinia viruses was confirmed on an aliquot (50 

pi) of the cell lysate after the MPA selection by means of PCR with the primers used to clone the respective 
HCV fragments (see Table 1). The recombinant vaccinia-HCV viruses were named according to the vaccinia 
recombination plasmid number, e.g. the recombinant vaccinia virus wHCV-10A was derived from recombining 
the wild type WR strain with the pvHCV-10A plasmid. ' 

25 

Example 3: infection of cells with recombinant vaccinia viruses 

A confluent monolayer of RK13 cells was infected at a m.o.i. of 3 with the recombinant HCV-vaccinia 
viruses as described in example 2 . For infection, the ceil monolayer was washed twice with phosphate-buffered 

3 0 saline pH 7.4 (PBS) and the recombinant vaccinia virus stock was diluted in MEM medium. Two hundred ul of 

the virus solution was added per 10 6 cells such that the m.oi was 3, and incubated for 45 min at 24°C. The 
virus solution was aspirated and 2 ml of complete growth medium (see example 2) was added per 1 0 5 cells. The 
cells were incubated for 24 hr at 37°C during which expression of the HCV proteins took place. 
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Example 4: Analysis of recombinant proteins by means of western blotting 

The infected cells were washed two times with PBS, directly lysed with lysis buffer (50 mM Tris.HCI pH 
7.5, 150 mM NaCI, 1% Triton X-100, 5 mM MgCIz, 1 fig/ml aprotinin (Sigma, Bomem, Belgium)) or detached 
5 from the flasks by incubation in 50 mM Tris.HCL pH 7.5/ 10 mM EDTA/ 150 mM NaCI for 5 min, and collected by 
centrifugation (5 min at 1000g). The cell pellet was then resuspended in 200 yi lysis buffer (50 mM Tris.HCL pH 
8.0, 2 mM EDTA, 150 mM NaCI, 5 mM MgCb, aprotinin, 1% Triton X-100) per 10 6 celts. The cell lysates 
were cleared for 5 min at 14,000 rpm in an Eppendorf centrifuge to remove the insoluble debris. Proteins of 20 
\jd lysate were separated by means of sodium dodecyl sulphate-poiyacryiamide gel electrophoresis (SDS- 

10 PAGE). The proteins were then electro-transferred from the gel to a nitrocellulose sheet (Amersham) using a 
Hoefer HSI transfer unit cooled to 4°C for 2 hr at 100 V constant voltage, in transfer buffer (25 mM Tris.HCI pH 
8.0, 192 mM glycine, 20% (v/v) methanol). Nitrocellulose filters were blocked with Blotto (5 % (w/v) fat-free 
instant milk powder in PBS; Johnson et ai.. 1981) and incubated with primary antibodies diluted in Btotto/0.1 % 
Tween 20. Usually, a human negative control serum or serum of a patient infected with HCV were 200 times 

1 5 diluted and preincubated for 1 hour at room temperature with 200 times diluted wild type vaccinia virus-infected 
cell lysate in order to decrease the non-specific binding. After washing with Blotto/0.1% Tween 20, the 
nitrocellulose fitters were incubated with alkaline phosphatase substrate solution diluted in Blotto/0.1 % Tween 
20. After washing with 0.1% Tween 20 in PBS, the filters were incubated with alkaline phosphatase substrate 
solution (100 mM Tris.HCI pH 9.5, 100 mM NaCI, 5 mM MgCIz, 0,38 jig/ml nitroblue tetrazolium, 0.155 jig/mi 5- 

2 0 bromo-4-chloro-3-indolylphosphate). All steps, except the electrotransfer, were performed at room temperature. 

Example 5: Purification of recombinant E1 or E2 protein 
25 5.1. Lysis 

Infected RK13 cells (carrying E1 or E2 constructs) were washed 2 times with phosphate-buffered saline 
(PBS) and detached from the culture recipients by incubation in PBS containing 10 mM EDTA. The detached 
cells were washed twice with PBS and 1 ml of lysis buffer (50 mM Tris.HCI pH 7.5, 1 50 mM NaCI, 1% Triton X- 
30 100, 5 mM MgCfe, 1 jig/ml aprotinin (Sigma, Bomem, Belgium) containing 2 mM biotinylated N-ethylmaleimide 
(biotin-NEM) (Sigma) was added per 10 5 cells at 4°C. This lysate was homogenized with a type B douncer and 
left at room temperature for 0.5 hours. Another 5 volumes of lysis buffer containing 10 mM N-ethylmaleimide 
(NEM, Aldrich, Bomem, Belgium) was added to the primary lysate and the mixture was left at room temperature 
for 15 min. Insoluble cell debris was cleared from the solution by centrifugation in a Beckman JA-14 rotor at 
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14,000 rpm (30100 g at fmax) for 1 hour at 4°C. 

5.2. Lectin Chromatography 

5 

The cleared cell lysate was loaded at a rate of 1ml/min on a 0.8 by 10 cm Lentil-lectin Sepharose 48 
column (Pharmacia) that had been equilibrated with 5 column volumes of lysis buffer at a rate of 1ml/min. The 
lentil-lectin column was washed with 5 to 10 column volumes of buffer 1 (0.1M potassium phosphate pH 7.3, 
500 mM KCI, 5% glycerol, 1 mM 6-Nhfc-hexanoic acid, 1 mM MgCI 2f and 1% DecylPEG (KWANT, Bedum, The 

10 Netherlands). In some experiments, the column was subsequently washed with 10 column volumes of buffer 1 
containing 0.5% Empigen-BB (Calbiochem, San Diego, CA, USA) instead of 1% DecylPEG. The bound material 
was eluted by applying elution buffer (10 mM potassium phosphate pH 7.3, 5% glycerol, 1 mM hexanoic acid, 
1mM MgCfe, 0.5% Empigen-BB, and 0.5 M a-methyl-mannopyranoside). The eiuted materia) was fractionated 
and fractions were screened for the presence of E1 or E2 protein by means of ELiSA as described in example 6. 

15 Figure 22 shows ELiSA results obtained from lentil lectin eluate fractions of 4 different El purifications of cell 
lysates infected with wHCV39 (type 1b), wHCV40 (type 1b), wHCV62 (type 3a), and wHCV63 (type 5a). 
Figure 23 shows the profiles obtained from the values shown in Figure 22. These results show that (he lectin 
affinity column can be employed for envelope proteins of the different types of HCV. 

20 

5.3. Concentration and partial reduction 

The E1- or E2-positive fractions were pooled and concentrated on a Centricon 30 kDa (Amicon) by 
centrifugation for 3 hours at 5,000 rpm in a Beckman JA-20 rotor at 4°C. in some experiments the E1- or E2- 

25 positive fractions were pooled and concentrated by nitrogen evaporation. An equivalent- of 3. 10* cells was 
concentrated to approximately 200 jj. For partial reduction, 30% Empigen-BB (Calbiochem, San Diego, CA, 
USA) was added to this 200 yd to a final concentration of 3.5 %, and 1 M DTT in HaO was subsequently added to 
a final concentration of 1.5 to 7.5 mM and incubated for 30 min at 37 °C. NEM (1M in dimethylsulphoxide) was 
subsequently added to a final concentration of 50 mM and left to react for another 30 min at 37°C to block the 

3 0 free sulphydry) groups. 

5.4. Gel filtration chromatography 



A Superdex-200 HR 10/20 column (Pharmacia) was equilibrated with 3 column volumes PBS/3 e 
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Empigen-BB. The reduced mixture was injected in a 500 u) sample loop of the Smart System (Pharmacia) and 
PBS/3% Empigen-BB buffer was added for geifiltraiicn. Fractions of 250 ,ul were collected from Vo to Vt. The 
fractions were screened for the presence of El or E2 protein as described in example 6. 

Figure 24 shows EUSA results obtained from fractions obtained after geifiltration chromatography of 4 
5 different E1 purifications of cell lysates infected with wHCV39 (type 1 b), wHCV40 (type 1 b), wHCV62 (type 3a), 
and WHCV63 (type 5a). Figure 25 shows the profiles obtained from purifications of E1 proteins of types 1b, 3a, 
and 5a (from RK13 cells infected with wHCV39, wHCV62 t and wHCV63, respectively; purified on lentil lectin 
and reduced as in the previous examples). The peaks indicated with T, "2 1 , and *2\ represent pure El protein 
peaks (El reactivity manly in fractions 26 to 30). These peaks show very similar molecular weights of 

1 0 approximately 70 kDa, corresponding to dimeric E1 protein. Other peaks in the three profiles represent vaccinia 
virus and/or cellular proteins which could be separated from E1 only because of the reduction step as outlined in 
example 5.3. and because of the subsequent geifiltration step in the presence of the proper detergent As shown 
in Figure 26 pool 1 (representing fractions 10 to 17) and pool 2 (representing fractions 18 to 25) contain 
contaminating proteins not present in the E1 pool (fractions 26 to 30). The E1 peak fractions were ran on 

15 SDS/PAGE and blotted as described in example 4. Proteins labelled with NEM-biotin were detected by 
streptavidin-alkaline phosphatase as shown in Figure 27. It can be readily observed that, amongst others, the 
29 kDa and 45kOa contaminating proteins present before the gelfiitration chromatography (lane 1) are only 
present at very low levels in the fractions 26 to 30. The band at approximately 65kDa represents the E1 dimeric 
form that could not be entirely disrupted into the monomeric E1 form. Similar results were obtained for the type 

2 0 3a E1 protein (lanes 10 to 15), which shows a faster mobility on SDS/PAGE because of the presence of only 5 
carbohydrates instead of 6. Figure 28 shows a silver stain of an SDS/PAGE gel run in identical conditions as in 
Figure 26. A complete overview of the purification procedure is given in Figure 29. 

The presence of purified E1 protein was further confirmed by means of western blotting as described in 
example 4, The dimeric E1 protein appeared to be non-aggregated and free of contaminants. The subtype 1b. 

25 E1 protein purified from wHCV40-infected ceils according to the above scheme was aminoterminally sequenced 
on an 477 Perkins-Elmer sequencer and appeared to contain a tyrosine as first residue. This confirmed that the 
E1 protein had been cleaved by the signal peptidase at the correct position (between A191 and Y192) from its 
signal sequence. This confirms the finding of Hijikata et al. (1991) that the aminoterminus of the mature E1 
protein starts at amino acid position 1 92. 

30 

5.5. Purification of the £2 protein 

The E2 protein (amino acids 384 to 673) was purified from RK13 cells infected with wHCV44 as 
indicated in Examples 5.1 to 5.4. Figure 30 shows the ODzso profile (continuous line) of the lentil lectin 
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chromatography. The dotted line represents the E2 reactivity as detected by EUSA (see example 6). Figure 31 
shows the same profiles obtained from gelfiltration chromatography of the lentil-lectin E2 pool (see Figure 30), 
part of which was reduced and blocked according to the methods as set out in example 5.3., and part of which 
was immediately applied to the column. Both parts of the E2 pool were run on separate gelfiltration columns. It 
5 could be demonstrated that E2 forms covalently-linked aggregates with contaminating proteins if no reduction 
has been performed. After reduction and blocking, the majority of contaminating proteins segregated into the Vq 
fraction. Other contaminating proteins copurified with the E2 protein, were not covalentty linked to the E2 protein 
any more because these contaminants could be removed in a subsequent step. Figure 32 shows an additional 
Ni 2+ -IMAC purification step carried out for the E2 protein purification. This affinity purification step employs the 6 

10 histidine residues added to the E2 protein as expressed from wHCV44. Contaminating proteins either run 
through the column or can be removed by a 30 mM imidazole wash. Figure 33 shows a silver-stained 
SOS/PAGE of 0.5 fig of purified E2 protein and a 30 mM imidazole wash. The pure E2 protein could be easily 
recovered by a 200 mM imidazole elution step. Figure 34 shows an additional desalting step intended to remove 
imidazole and to be able to switch to the desired buffer, e.g. PBS, carbonate buffer, saline. 

1 5 Starting from about 50,000 cm 3 of RK1 3 cells infected with wHCV1 1 A (or wHCV40) for the production 

of E1 or WHCV41 , wHCV42. wHCV43. or wHCV44 for production of E2 protein, the procedures described in 
examples 5.1 to 5.5 ailow the purification of approximately 1 .3 mg of E1 protein and 0.6 mg of E2 protein. 

It should also be remarked that secreted E2 protein (constituting approximately 30-40%, 60-70% being 
in the intracellular form) is chracterized by aggregate formation (contrary to expectations). The same problem is 

2 0 thus posed to purify secreted E2. The secreted E2 can be purified as disclosed above. 



Example 6: EUSA for the detection of anti-E1 or anti-E2 antibodies or for the detection of E1 or E2 
proteins 

25 

Maxisorb microwell plates (Nunc, Roskilde, Denmark) were coated with 1 volume (e.g. 50 jj or 100 p) 
or 200 pJ) per weil of a 5 u.g/ml solution of Streptavidin (Boehringer Mannheim) in PBS for 16 hours at 4°C or for 
1 hour at 37°C. Alternatively, the wells were coated with 1 volume of 5 jig/ml of Gaianthus nivalis agglutinin 
(GNA) in 50 mM sodium carbonate buffer pH 9.6 for 16 hours at 4°C or for 1 hour at 37°C. In the case of coating 
3 0 with GNA, the plates were washed 2 times with 400 4J of Washing Solution of the Innotest HCV'Ab II! kit 
(Innogenetics, Zwijndrecht, Belgium). Unbound coating surfaces were blocked with 1.5 to 2 volumes of blocking 
solution (0.1% casein and 0.1% NaN 3 in PBS) for 1 hour at 37°C or for 16 hours at 4°C. Blocking solution was 
aspirated. Purified E1 or E2 was diluted to 100-1000 ng/ml (concentration measured at A = 280 nm) or column 
fractions to be screened for E1 or E2 (see example 5), or E1 or E2 in non-purified cell lysates (example 5.1.) 
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were diluted 20 times in blocking solution, and 1 volume of the El or E2 solution was added to each well and 
incubated for 1 hour at 37°C on the Streptavidin- or GNA-coated plates. The microwells were washed 3 times 
with 1 volume of Washing Solution of the Innotest HCV Ab III kit (Innogenetics, Zwijndrecht, Belgium). Serum 
samples were diluted 20 times or monxlonal anti-El or anti-E2 antibodies were diluted to a concentration of 20 
5 ng/ml in Sample Diluent of the Innotest HCV Ab III kit and 1 volume of the solution was left to react with the E1 
or E2 protein for 1 hour at 37°C. The microwells were washed 5 times with 400 of Washing Solution of the 
Innotest HCV Ab lit kit (Innogenetics, Zwijndrecht, Belgium). The bound antibodies were detected by incubating 
each well for 1 hour at 37°C with a goat anti-human or anti-mouse IgG, peroxidase-conjugated secondary 
antibody (DAKO, Glostwp, Denmark) diluted 1/80,000 in 1 volume of Conjugate Diluent of the Innotest HCV Ab 
10 III kit (Innogenetics, Zwijndrecht, Belgium), and color development was obtained by addition of substrate of the 
Innotest HCV Ab III kit (Innogenetics, Zwijndrecht, Belgium) diluted 100 times in 1 volume of Substrate Solution 
of the Innotest HCV Ab III kit (Innogenetics, Zwijndrecht, Belgium) for 30 min at 24 C C after washing of the plates 
3 times with 4Q0 }.d of Washing Solution of the Innotest HCV Ab 111 kit (innogenetics, Zwijndrecht, Belgium). 

15 

Example 7: Follow up of patient groups with different clinical profiles 
7.1. Monitoring of anti-E1 and anti-E2 antibodies 

20 The current hepatitis C virus (HCV) diagnostic assays have been developed for screening and 

confirmation of the presence of HCV antibodies. Such assays do not seem to provide information useful for 
monitoring of treatment or for prognosis of the outcome of disease. However, as is the case for hepatitis B, 
detection and quantification of anti-envelope antibodies may prove more useful in a clinical setting. To 
investigate the possibility of the use of anti-E1 antibody titer and anti-E2 antibody titer as prognostic markers for 

2 5 outcome of hepatitis C disease, a series of lFN-a treated patients with long-term sustained response (defined as 

patients with normal transaminase levels and negative HCV-RNA test (PCR in the 5* non-coding region) in the 
blood for a period of at least 1 year after treatment) was compared with patients showing no response or 
showing biochemical response with relapse at the end of treatment. 

A group of 8 lFN-a treated patients with long-term sustained response (LTR, follow up 1 to 3.5 years, 3 

3 0 type 3a and 5 type 1b) was compared with 9 patients showing non-complete responses to treatment (NR, -follow 

up 1 to 4 years, 6 type 1b and 3 type 3a). Type 1b (wHCV-39, see example 2.5.) and 3a E1 (wHCV-62, see 
example 2.5.) proteins were expressed by the vaccinia vims system (see examples 3 and 4) and purified to 
homogeneity (example 5). The samples derived from patients infected with a type 1b hepatitis C virus were 
tested for reactivity with purified type 1b E1 protein, while samples of a type 3a infection were tested for reactivity 
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of anti-type 3a El antibodies in an BJSA as desribed in example 6. The genotypes of hepatitis C viruses 
infecting the different patients were determined by means of the Inno-UPA genotyping assay (Innogenetics, 
Zwijndrecht. Belgium). Figure 5 shows the anti-61 signaMo-noise ratios of these patients followed during the 
course of interferon treatment and during the follow-up period after treatment. LTR cases consistently showed 
5 rapidly declining anti-E1 levels (with complete negativation in 3 cases), while anti-El levels of NR cases 
remained approximately constant Some of the obtained anti-E1 data are shown in Table 2 as average S/N 
ratios ± SD (mean anti-E1 titer). The anti-E1 titer could be deduced from the signal to noise ratio as show in 
Figures 5, 6, 7, and 8. 

Already at the end of treatment, marked differences could be observed between the 2 groups. Anti-E1 

10 antibody titers had decreased 6.9 times in LTR but only 1 .5 times in NR. At the end of follow up, the anti-E1 titers 
had declined by a factor of 22.5 in the patients with sustained response and even slightly increased in NR. 
Therefore, based on these data, decrease of 3nti-E1 antibody levels during monitoring of IFN-a therapy 
correlates with long-term, sustained response to treatment The anti-E1 assay may be very useful for prognosis 
of long-term response to I FN treatment, or to treatment of the hepatitis C disease in general. 

1 5 This finding was not expected. On the contrary, the inventors had expected the anti-E1 antibody levels 

to increase during the course of IFN treatment in patients with long term response. As is the case for hepatitis 8, 
the virus is cleared as a consequence of the seroconversion for anti-HBsAg antibodies. Also in many other virus 
infections, the virus is eliminated when anti-enveiope antibodies are raised. However, in the experiments of the 
present invention, anti-E1 antibodies clearly decreased in patients with a long-term response to treatment, while 

20 the antibody-level remained approximately at the same level in non-responding patients. Although the outcome 
of these experiments was not expected, this non-obvious finding may be very important and useful for clinical 
diagnosis of HCV infections. As shown in Figures 9, 10, 11, and 12, anti-E2 levels behaved very differently in the 
same patients studied and no obvious decline in titers was observed as for anti-E1 antibodies. Figure 35 gives a 
complete overview of the pilot study. 

25 As can be deduced from Table 2 r the anti-El titers were on average at least 2 times higher at the start 

. of treatment in long term responders compared with incomplete responders to treatment. Therefore, measuring 
the titer of anti-E1 antibodies at the start of treatment, or monitoring the patient during the course of infection and 
measuring the anti-El titer, may become a useful marker for clinical diagnosis of hepatitis C. Furthermore, the 
use of more defined regions of the E1 or E2 proteins may become desirable, as shown in example 7.3. 

30 

7.2. Analysis of E1 and E2 antibodies in a larger patient cohort 

The pilot study lead the inventors to conclude that, in case infection was completely cleared, antibodies 
to the HCV envelope proteins changed more rapidly than antibodies to the more conventionally studied HCV 
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antigens, with E1 antibodies changing most vigorously. We therefore included more type 1b and 3a-infected 
LTR and further supplemented the cohort with a matched series of NR, such that both groups included 14 
patients each. Some partial responders (PR) and responders with relapse (FIR) were also analyzed. 

Figure 36 depicts average El antibody (EUb) and E2 antibody (E2Ab) levels in the LTR and NR 
5 groups and Tables 4 and 5 show the statistical analyses. In this larger cohort, higher E1 antibody levels before 
IFN-cc therapy were associated with LTR (P < 0.03). Since much higher E1 antibody levels were observed in 
type 3a-inr*ected patients compared with type 1b-infected patients (Figure 37), the genotype was taken into 
account (Table 4). Within the type lb-infected group, LTR also had higher E1 antibody levels than NR at the 
initiation of treatment (P < 0.05]; the limited number of type 3a-infected NR did not allow statistical analysis. . 

10 Of antibody levels monitored in LTR during the 1 .5-year follow up period, only E1 antibodies cleared 

rapidly compared with levels measured at initiation of treatment [P = 0.0058, end of therapy; P = 0.0047 and P = 
0.0051 at 6 and 12 months after therapy, respectively]. This clearance remained significant within type 1- or type 
3-infected LTR (average P values < 0.05). These data confirmed the initial finding that ElAb levels decrease 
rapidly in the early phase of resolvement. This feature seems to be independent of viral genotype. In NR, PR, or 

15 RR, no changes in any of the antibodies measured were observed throughout the follow up period. In patients 
who responded favourably to treatment with normalization of ALT levels and HCV-RNA negative during 
treatment, there was a marked difference between sustained responders (LTR) and responders with a relapse 
(RR). In contrast to LTR, RR did not show any decreasing E1 antibody levels, indicating the presence of occult 
HCV infection that could neither be demonstrated by PCR or other classical techniques for detection of HCV- 

2 0 RNA, nor by raised ALT levels. The minute quantities of viral RNA, still present in the RR group during treatment, 
seemed to be capable of anti-El B cell stimulation. Anti-E1 monitoring may therefore not only be able to 
discriminate LTR from NR, but also from RR. 

25 7.3. Monitoring of antibodies of defined regions of the E1 protein 

Although the molecular biological approach of identifying HCV antigens resulted in unprecedented 
breakthrough in the development of viral diagnostics, the method of immune screening of Xgt1 1 libraries 
predominantly yielded linear epitopes dispersed throughout the core and non-structural regions, and analysis of 
30 the envelope regions had to await cloning and expression of the E1/E2 region in mamrnaiian cells. This 
approach sharply contrasts with many other viral infections of which epitopes to the envelope regions had 
already been mapped long before the deciphering of the genomic structure. Such epitopes and corresponding 
antibodies often had neutralizing activity useful for vaccine development and/or allowed the development of 
diagnostic assays with clinical or prognostic significance (e.g. antibodies to hepatitis B surface antigen). 
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As no HCV vaccines or tests allowing clinical diagnosis and prognosis of hepatitis C disease are available today, 
the characterization of viral envelope regions exposed to immune sur/eillance may significantly contribute to 
new directions in HCV diagnosis and prophylaxis. 

Several 20-mer peptides (Table 3) that overlapped each other by 8 amino acids, were synthesized 
5 according to a previously described method (EP-A-0 489 968) based on the HC-J1 sequence (Okamoto et ai„ 
1990). None of these, except peptide env35 (also referred to as El -35), was able to detect antibodies in sera of 
approximately 200 HCV cases. Only 2 sera reacted slightly with the env35 peptide. However, by means of the 
anti-E1 ELISA as described in example 6, it was possible to discover addtionai epitopes as follows: The anti-El 
ELISA as described in example 6 was modified by mixing 50 \iqlm\ of E1 peptide with the 1/20 diluted human 

10 serum in sample diluent. Figure 13 shows the results of reactivity of human sera to the recombinant E1 
(expressed from wHCV-40) protein, in the presence of single or of a mixture of E1 peptides. While only 2% of 
the sera could be detected by means of E1 peptides coated on strips In a Line Immunoassay format, over half of 
the sera contained anti-E1 antibodies which could be competed by means of the same peptides, when tested on 
the recombinant E1 protein. Some of the murine monoclonal antibodies obtained from Balb/C mice after 

1 5 injection with purified E1 protein were subsequently competed for reactivity to El with the single peptides (Figure 
14). Clearly, the region of env53 contained the predominant epitope, as the addition of env53 could substantially 
compete reactivity of several sara with E1, and antibodies to the env31 region were also detected. This finding 
was surprising, since the env53 and env31 peptides had not shown any reactivity when coated directly to the 
solid phase. 

20 Therefore peptides were synthesized using technology described by applicant previously (in WO 

93/18054). The following peptides were synthesized: 
peptide env35A-biotin 

NHz-SNSSEMDMIMHTPGCV-GKbiotin (SEQ ID NO 51) 

spanning amino acids 208 to 227 of the HCV polyprotein in the E1 region 

2 5 peptide biotin-env53 ('epitope A') 

biotin-GG-ITGHRMAWDMMMNWSPTTAL-COOH (SEQ ID NO 52) 
spanning amino acids to 313 of 332 of the HCV polyprotein in the E1 region 
peptide 1bE1 ('epitope B') 

HzN-YEVRNVSGlYHVTNDCSNSSIVYEAADMIMHTPGCGK -biotin (SEQ ID NO 53) 

3 0 , spanning amino acids 1 92 to 228 of the HCV polyprotein in the E1 region 

and compared with the reactivities of peptides E1a-88 (biotin-GG-TPTVATRDGKLPATQLRRHIDLL, SEQ 10 
NO 54) and E1b-8B (biotin-GG-TPTLAARDASVPTTTlRRHVDLL, SEQ ID NO 55) which are derived from the 
same region of sequences of genotype 1a and 1b respectively and which have been described at the IXth 
international virology meeting in Glasgow, 1993 ('epitope C). Reactivity of a panel of HCV sera was tested on 



02/055548 



52 



PCT7EP02/00219 



epitopes A, B and C and epitope B was also compared with env35A (of 47 HCV-positive sera, 8 were positive on 
epitope B and none reacted with env35A). Reactivity towards epitopes A ? B t and C was tested directly to the 
biotinylated peptides (50 jig/ml) bound to streptavidin-coated plates as described in example 6. Clearly, epitopes 
A and B were most reactive while epitopes C and env35A-biotin were much less reactive. The same series of 
patients that had been monitored for their reactivity towards the complete E1 protein (example 7.1.) was tested 
for reactivity towards epitopes A, B, and C. Little reactivity was seen to epitope C, while as shown in Figures 15, 
16, 17, and 18, epitopes A and B reacted with the majority of sera. However, antibodies to the most reactive 
epitope (epitope A) did not seem to predict remission of disease, while the anti-1bE1 antibodies (epitope B) were 
present almost exclusively in long term responders at the start of IFN treatment. Therefore, anti-1 bE1 (epitope B) 
antibodies and anti-env53 (epitope A) antibodies could be shown to be useful markers for prognosis of hepatitis 
C disease. The env53 epitope may be advantageously used for the detection of cross-reactive antibodies 
(antibodies that cross-react between major genotypes) and antibodies to the env53 region may be very useful 
for universal E1 antigen detection in serum or liver tissue. Monoclonal antibodies that recognized the env53 
region were reacted with a random epitope library. In 4 clones that reacted upon immunoscreening with the 
monoclonal antibody 5E1 A10, the sequence -GWD- was present. Because of its analogy with the universal HCV 
sequence present in all HCV variants in the env53 region, the sequence AWD is thought to contain the essential 
sequence of the env53 cross-reactive murine epitope. The env31 clearly also contains a variable region which 
may contain an epitope inihe amino terminal sequence -YQVRNSTGL- (SEQ ID NO 93) and may be useful for 
diagnosis. Env31 or El-31 as shown in Table 3, is a part of the peptide 1bE1. Peptides E1-33 and E1-51 also 
reacted to some extent with the murine antibodies, and peptide E1-55 (containing" the variable region 6 (V6); 
spanning amino acid positions 329-336) also reacted with some of the patient sera 

Anti-E2 antibodies clearly followed a different pattern than the anti-El antibodies, especially in patients 
with a long-term response to treatment. Therefore, it is clear that the decrease in anti-envelope antibodies could 
not be measured as efficiently with an assay employing a recombinant E1/E2 protein as with a single anti-E1 or 
anti-E2 protein. The anti-E2 response would clearly blur the anti-El response in an assay measuring both kinds 
of antibodies at the same time. Therefore, the ability to test anti-envelope antibodies to the single E1 and E2 
proteins, was shown to be useful. 

7,4. Mapping of anti-E2 antibodies 

Of the 24 anti-E2 Mabs only three could be competed for reactivity to recombinant E2 by peptides, two 
of which reacted with the HVRI region (peptides E2-67 and E2-69, designated as epitope A) and one which 
recognized an epitope competed by peptide E2-13B (epitope C). The majority of murine antibodies recognized 
conformational anti-E2 epitopes (Figure 19). A human response to HVRI (epitope A), and to a lesser extent 
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HVRII (epitope B) and a third linear epitope region (competed by peptides E2-23, E2-25 or E2-27, designated 
epitope E) and a fourth linear epitope region (competed by peptide E2-17B, epitope D) could also frequently be 
observed, but the majority of sera reacted with conformational epitopes (Figure 20). These conformational 
epitopes could be grouped according to their relative positions as follows: the IgG antibodies in the supernatant 
5 of hybridomas 15C8C1, 12D11F1, 9G3E6,8G1GD1H9, 10D3C4, 4H6B2, 17F2C2, 5H6A7, 15B7A2 recognizing 
conformational epitopes were purified by means of protein A affinity chromatography and t mg/ml or the 
resulting IgG's were biotinylated in borate buffer in the presence of biotin. Biotinylated antibodies were separated 
from free biotin by means of gelfiltration chromatography. Pooled biotinylated antibody fractions were diluted 100 
to 10,000 times. E2 protein bound to the solid phase was detected by the biotinylated IgG in the presence of 100 
10 times the amount of non-biotinylated competing antibody and subsequently detected by alkaline phosphatase 
labeled streptavidin. 

Percentages of competition are given in Table 6. Based on these results, 4 conformational anti-E2 
epitope regions (epitopes F, G, H and I) could be delineated (Figure 38). Alternatively, these Mabs may 
recognize mutant linear epitopes not represented by the peptides used in this study. Mabs 4H6B2 and 10D3C4 
15 competed reactivity of 16A6E7, but unlike 16A6E7, they did not recognize peptide E2-13B. These Mabs may 
recognize variants of the same linear epitope (epitope C) or recognize a conformational epitope which is 
sterically hindered or changes conformation after binding of 1 6A5E7 to the E2-1 38 region (epitope H). 

20 



Example 8: E1 alycosvlation mutants 

25 

8.1. Introduction 

The E1 protein encoded by wHCV1 OA, and the E2 protein encoded by wHCV41 to 44 expressed from 
mammalian cells contain 6 and 11 carbohydrate moieties, respectively. This could be shown by incubating the 
3 0 lysate of wHCV1 OA-infected or wHCV44-infected RK13 cells with decreasing concentrations of glycosidases 
(PNGase F or Endoglycosidase H, (Boehringer Mannhein Biochemica) according to the manufacturer's 
instructions), such that the proteins in the lysate (including E1) are partially deglycosylated (Fig. 39 and 40, 
respectively). 

Mutants devoid of some of their glycosylation sites could allow the selection of envelope proteins with 
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improved immunological reactivity. For HIV for example, gp120 proteins lacking certain selected sugar-addition 
motifs, have been found to be particularly useful for diagnostic or vaccine purpose. The addition of a new 
oligosaccharide side chain in the hemagglutinin protein of an escape mutant of the A/Hong Kong/3/68 (H3N2) 
influenza virus prevents reactivity with a neutralizing monoclonal antibody (Skehel et al, 1984). When novel 
5 glycosylate sites were introduced into the influenza hemaglutinin protein by site-specific mutagenesis, dramatic 
antigenic changes were observed, suggesting that the carbohydrates serve as a modulator of antigenicity 
(Gallagher et al., 1988). In another analysis, the 8 carbohydrate-addition motifs of the surface protein gp70 of the 
Friend Murine Leukemia Virus were deleted. Although seven of the mutations did not affect virus infectivity, 
mutation of the fourth glycosylate signal with respect to the amino terminus resulted in a non-infectious 

10 phenotype (Kayman et al., 1991). Furthermore, ft is known in the art that addition of N-linked carbohydrate 
chains is important for stabilization cf folding intermediates and thus for efficient folding, prevention of malfolding 
and degradation in the endoplasmic reticulum, oiigomerization, biological activity, and transport of glycoproteins 
(see reviews by Rose et al., 1988: Dcms et al., 1993; Helenius, 1994). 

After alignment of the different envelope protein sequences of HCV genotypes, it may be inferred that 

1 5 not all 6 glycosylate sites on the HCV subtype 1 b E1 protein are required for proper folding and reactivity, since 
some are absent in certain (sub)iypss. The fourth carbohydrate motif (on Asn251), present in types 1b, 6a, 7, 8, 
and 9, is absent in all other types know today. This sugar-addition motif may be mutated to yield a type 1b E1 
protein with improved reactivity. Also :he rype 2b sequences show an extra glycosylate site in the V5 region 
(on Asn299). The isolate S83, belonging to genotype 2c, even lacks the first carbohydrate motif in the V1 region 

2 0 (on Asn), while it is present on all other isolates (Stuyver et al., 1994) However, even among the completely 
conserved sugar-addition motifs, the presence of the carbohydrate may not be required for folding, but may 
have a role in evasion of immune surveillance. Therefore, identification of the carbohydrate addition motifs which 
are not required for proper folding (and reactivity) is not obvious, and each mutant has to be analyzed and tested 
for reactivity. Mutagenesis of a glycosylate motif (NXS or NXT sequences) can be achieved by either mutating 

25 the codons for N, S, or T, in such a way that these codons encode amino acids different from N in the case of N, 
and/or amino acids different from S or T in the case of S and in the case of T. Alternatively, the X position may 
be mutated into P, since it is known that NFS or NPT are not frequently modified with carbohydrates. After 
establishing which carbohydrate-addition motifs are required for folding and/or reactivity and which are not, 
combinations of such mutations may be made. 

30 

8.2. Mutagenesis of the E1 protein 

Al! mutations were performed on the E1 sequence of clone HCCI1 OA (SEQ ID NO. 5), The first round of 
PCR was performed using sense primer 'GPT (see Table 7) targetting the GPT sequence located upstream of 
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the vaccinia 11K late promoter, and an antisensa primer (designated GLY#, with £ representing the number of 
the gfycosylation site, see Fig. 41) containing the desired base change to obtain the mutagenesis. The six GLY# 
primers (each specific tor a given glycosyiation site) were designed such that: 

- Modification of the codon encoding for the N-glycosylated Asn (AAC or AAT) to a Gin codon (CAA or CAG). 
5 Giutamine was chosen because it is very similar to asparagine (both amino acids are neutral and contain non- 
polar residues, giutamine has a longer sice chain (one more -CH2- group). 

- The introduction of silent mutations in one or several of the codons downstream of the glycosyiation site, in 
order to create a new unique or rare (e.g. a second Smal site for E1GIy5) restriction enzyme site. Without 
modifying the amino acid sequence, this mutation will provide a way to distinguish the mutated sequences from 

10 the original E1 sequence (pvHCV-10A) or from each other (Figure 41) . This additional restriction site may also 
be useful for the construction of new hybrid (double, triple, etc.) glycosyiation mutants. 
- 18 nucleotides extend 5' of the first mismatched nucleotide and 12 to 18 nucleotides extend to the 3* end. Table 
7 depicts the sequences of the six GLY# primers overlapping the sequence of N-linked glycosyiation sites. 

For site-directed mutagenesis, the 'mispriming 1 or 'overlap extension* (Horton, 1993) was used. The 

15 concept is illustrated in Figures 42 and 43. First, two separate fragments were amplified from the target gene for 
each mutated site. The PCR product obtained from the 5' end (product GLYt) was amplified with the 5' sense 
GPT primer (see Table 7) and with the respective 3' antisense GLY# primers. The second fragment (product 
OVR#) was amplified with the 3' antisense Tfa primer and the respective 5 sense primers (OVR# primers, see 
Table 7, Figure 43). 

2 0 The OVR# primers target part of the GLY# primer sequence. Therefore, the two groups of PCR 

products share an overlap region of identical sequence. When these intermediate products are mixed (GLY-1 
with OVR-i, GLY-2 with OVR-2, etc.), melted at high temperature, and reannealed, the top sense strand of 
product GLY# can anneal to the antisense strand of product OVR# (and vice versa) in such a way that the two 
strands act as primers for one another (see Fig. 42.3.). Extension of the annealed overlap by Taq polymerase 

2 5 during two PCR cycles created the full-length mutant molecule E1Gly#, which carries the mutation destroying 

the glycosyiation site number S. Sufficient quantities of the E1GLY& products for cloning were generated in a 
third PCR by means of a common, set of two internal nested primers. These two new primers are respectively 
overlapping the 3' end of the vaccinia 1 1K promoter (sense GPT-2 primer) and the 5' end of the vaccinia 
thymidine kinase locus (antisense TKa-2 primer, see Table 7). All PCR conditions were performed as described 

3 0 inStuyveretal.(1993). 

Each of these PCR products was cloned by EcoRI/BamHI cleavage into the EcoRI/BamHI-cut vaccinia 
vector containing the original E1 sequence (pvHCV-10A). 

The selected clones were analyzed for length of insert by EcoRl/BamH 1 cleavage and for the presence 
of each new restriction site. The sequences overlapping the mutated sites were confirmed by double-stranded 
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sequencing. 

8,3, Analysis of E1 qlvcosvlation mutants 

5 Starting from the 6 plasmids containing the mutant E1 sequences as described in example 8.2, 

recombinant vaccinia viruses were generated by recombination with wt vaccinia virus as described in example 
2.5. Briefly, 175 cm 2 -flasks of subconfluent RK13 cells were infected with the 6 recombinant vaccinia viruses 
carrying the mutant E1 sequences, as well as with the wHCV-10A (carrying the non-mutated El sequence) and 
wt vaccinia viruses. Cells were lysed after 24 hours of infection and analyzed on western blot as described in 

10 example 4 (see Figure 44A). Ail mutants showed a faster mobility (corresponding to a smaller molecular weight 
of approximately 2 to 3 kDa) on SDS-PAGE than the original E1 protein; confirming that one carbohydrate 
moiety was not added. Recombinant viruses were also analyzed by PGR and restriction enzyme analysis to 
confirm the identity of the different mutants. Figure 448 shows that all mutants (as shown in Figure 41) 
contained the expected additional restriction sites. Another part of the cell iysate was used to test the reactivity of 

15 the different mutant by ELISA. The lysates were diluted 20 times and added to microwel! plates coated with the 
lectin GNA as described in example 6. Capturec (mutant) E1 glycoproteins were left to react with 20-times 
diluted sera of 24 HCV-infected patients as described in example 6. Signal to noise (S/N) values (00 of 
GLY#/OD of wt) for the six mutants and E1 are snovm in Table 8. The table also shows the ratios between S/N 
values of GLY# and E1 proteins. It should be uncsrstcod that the approach to use ceil lysates of the different 

20 mutants for comparison of reactivity with patient sera may result in observations that are the consequence of 
different expression levels rather then reactivity levels. Such difficulties can be overcome by purification of the 
different mutants as described in example 5, and by testing identical quantities of all the different E1 proteins. 
However, the results shown in table 5 already indicate that removal of the 1st (GLY1), 3rd (GLY3), and 5th 
(GLY6) glycosylate motifs reduces reactivity of some sera, while removal of the 2nd and 5th site does not. 

25 Removal of GLY4 seems to improve the reactivity of certain sera. These data indicate that different patients 
react differently to the glycosylate mutants of the present invention. Thus, such mutant E1 proteins may be 
useful for the diagnosis (screening, confirmation, prognosis, etc.) and prevention of HCV disease, • 

30 Example 9: Expression of HCV E2 protein in qtvcosvlation-deficient yeasts 

The E2 sequence corresponding to done HCCL41 was provided with the a-mating factor pre/pro 
signal sequence, inserted in a yeast expression vector and S. cerevisiae cells transformed with this construct 
secreted E2 protein into the growth medium. It was observed that most glycosylation sites were modified with 
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high-mannose type glycosylates upon expression of such a construct in S. cerevisiae strains (Figure 45). This 
resulted in a too high level of heterogeneity and in shielding of reactivity, which is not desirable for either vaccine 
or diagnostic purposes. To overcome this problem, S. cerevisiae mutants with modified glycosylation pathways 
were generated by means of selection of vanadate-resistant clones. Such clones were analyzed for modified 
5 glycosylation pathways by analysis of the molecular weight and heterogeneity of the glycoprotein invertase. This 
allowed us to identify different glycosylation deficient S. cerevisiae mutants. The E2 protein was subsequently 
expressed in some of the selected mutants and left to react with a monoclonal antibody as described in example 
7, on western blot as described in example 4 (Figure 46). 

10 Example 10. General utility 

The present results show that not only a good expression system but also a good purification protocol 
are required to reach a high reactivity of the HCV envelope proteins with human patient sera. This can be 
obtained using the proper HCV envelope protein expression system and/or purification protocols of the present 

1 5 invention which guarantee the conservation of the natural folding of the protein and the purification protocols of 
the present invention which guarantee the elimination of contaminating proteins and which preserve the 
conformation, and thus the reactivity of the HCV envelope proteins. The amounts of purified HCV envelope 
protein needed tor diagnostic screening assays are in the range of grams per year. For vaccine purposes, even 
higher amounts of envelope protein would be needed. Therefore, the vaccinia virus system may be used for 

2 0 selecting the best expression constructs and for limited upscaling, and large-scale expression and purification of 
single or specific oligomeric envelope proteins containing high-mannose carbohydrates may be achieved when 
expressed from several yeast strains. In the case of hepatitis B for example, manufacturing of HBsAg from 
mammalian cells was much more costly compared with yeast-derived hepatitis B vaccines. 

The purification method dislcosed in the present invention may also be used for Viral envelope 

2 5 proteins* in general. Examples are those derived from Raviviruses, the newly discovered GB-A, GB-B and GB-C 

Hepatitis viruses, Pestiviruses (such as Bovine viral Diarrhoea Virus (BVDV), Hog Cholera Virus (HCV), Border 
Disease Virus (BDV)), but also less related virusses such as Hepatitis 8 Virus (mainly for the purification of 
HBsAg), 

The envelope protein purification method of the present invention may be used for intra- as well as 

3 0 extracellularly expressed proteins in lower or higher eukaryotic cells or in prokaryotes as set out in the detailed 

description section. 

Example 11. Demonstration of Prophylactic and Therapeutic Utility 
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Liver disease in chimpanzees chronically infected with HCV can be reduced by immunization with Et . Multiple 
immunizations, however, were required In order to reach a significant immune response. One of ordinary skill will 
appreciate that viral persistence is produced with immune modulation which is either orchestrated by the virus itself or by 
the host In order to analyze if such an immune modulation does exist in HCV, the immune responses against El and 
5 NS3 in naive and chronically infected chimpanzees were compared. Since a lower response in the chronically infected 
animals was anticipated, this group of animals was selected for a more rigorous immunization schedule including the 
following: use of an adjuvant proven in mice to be more potent for inducing cellular responses {Table 9) compared to 
alum, which was the adjuvant used for naive animals; and the immunization schedule for chronically infected animals 
consisted of 12 immunizations compared to 6 for naive animals (Fig. 47). 

1 0 Although the number of immunized animals does not allow statistical analysis, the following clear tendency can 

be detected in the humoral responses (Table 10): the number of immunizations for seroconversion is lower in naive 
animals; and the magnitude of the immune response is substantially greater in the naive animals, 2/3 infected animals 
do not reach the level of 10 internal units, even after 12 immunizations. 

The analysis of the cellular responses, after three immunizations, reveais an even larger difference (Fig. 48a- 

15 d), including the following: El -specific T-cell proliferation is almost aosent in the chronically infected animals, while a 
clear stimulation can be seen in the naive setting; IL-2 measurements confirmed that the low stimulation of the T-cell 
compartment in chronic carriers; and, a clear 7h2 (1L-4) response in naive animals is induced, as expected for an alum- 
adjuvant containing vaccine. 

This confirms that at least E1 immunization provides a prophylactic effect in naive animals and suggest that E2 

2 C and/or combinations of E1 and E2 proteins and/or peptides may provide useful therapeutic and/or prophylactic benefits 
in naive animals. 

The 'impairment 1 to induce both cellular and humoral responses against an HCV El antigen can be only 
partially overcome by multiple immunizations, as demonstrated by the following results: an increase in antibody titer 
after each injection was noted but the levels as in naive animals were not reached in 2/3 animals; and the T-cell 

2 5 proliferative responses remain very low (Fig. 49). The ELISPOT results show, however, a minor increase in IL-2 (not 

shown), no change in IFN-g (not shown) and an increase in 11-4 (Fig. 49) which indicates that Th2 type responses are 
more readily induced. 11-4 was noted to remain at a low level compared to the level reached after three immunizations in 
naive animals. 

A quite similar observation was made for NS3 immunizations where an even stronger adjuvant (RIBI) was used 

3 0 in the chronic chimpanzee. As compared with an alum formulation in naive animals the fallowing has been noted: the 

induced antibody titers are comparable in both groups (not shown); and both cytokine secretion and T-cell proliferation 
are almost absent in the chronic animals compared to the responses in naive animals (Fig. 49a-b). 

Currently there have been some indications that Immune responses against HCV in chronic carriers are low or 
at least insufficient to allow clearance of infection. The above results support the hypothesis that the immune system of 
3 5 HCV chronic carriers may be impaired and that they do not respond to HCV antigens as efficiently as in a naive situation. 

in a study by Wiedmann et al., (Hepatology 2000; 31 : 230-234), vaccination for HBV was less effective in HCV 
chronic carriers, which indicates that such an immune impairment is not limited to HCV antigens. Oe Maria et al. 
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(Hepatology 2000; 32 444-445) confirmed these data and have proposed adapted vaccine dosing regimens for HCV 
patients. The data presented herein indicates that increasing the number of immunizations may indeed augment 
humoral responses but that cellular (especially Th1 ) responses are difficult to induce, even when powerful adjuvants are 
used. It may be advantageous to begin immunization ar the time of antiviral therapy, when the immune system is more 
5 prone to respond. 
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Table 1: Recombinant vaccinia plasmids and viruses 



Plasmid name 


Name 


cONA subclone construction • 


Length (nt/aa) 


Vector used 
for insertion 


pvHCV-13A 


E1s 


EcoR I - Hind III 


472/157 


pgptATA-18 


pvHCV-12A 


E1s 


EcoR 1 - Hind ill 


472/158 


pgptATA-18 


pvHCV-9A 


E1 


EcoR 1 - Hind 111 


631/211 


pgptATA-18 


pvHCV-11A 


E1s 


EcoR 1 - Hind (11 


625/207 


pgptATA-18 


pvHCV-17A . 


E1s 


EcoR 1 - Hind 111 


625/208 


pgptATA-18 


pvHCV-1QA ' 


E1 


EcoR 1- Hind III 


783/262 


pgptATA-18 ■ 


pvHCV-18A 


" COREs 


Acc I (KI) - EcoR 1 (Kl) 


403/130 


pgptATA-18 


pvHCV-34 


CORE 


Acc 1 (Kl) - Fsp 1 


595/197 . 


pgptATA-1 8 


pvHCV-33 


CORE-E1 


Acc 1 (Kl) 


1150/380 


pgptATA-18 


pvHCV-35 


CORE-E1b.his 


EcoR 1 - BamH 1 (Kl) 


1032/352. 


pMS-66 


pvHCV-36 


CORE-E1ruhts 


EcoR i - Nco I (KI) 


1106/376 


pMS-66 


pvHCV-37 


E1a 


Xma 1 - BamH 1 


' 711/239 


pvHCV-10A 


pvHCV-38 


EUs 


EcoR 1 - BstE II 


553/183 


pvHCV-11A 


pvHCV-39 


ElAb 


EcoR I - BamH I 


960/313 


pgsATA-1 8 


pvHCV-40 


EUb.his 


■ EcoR 1 - BamH I (Kl) 


* 960/323 


pMS-66 


pvHCV-41 


E2bs 


BamH 1 (KI)-AlwN 1 (T4) 


1005/331 


pgsATA-18 


pvHCV-42 


E2bs.hls 


BamH 1 (W)-AlwN 1 (T4) 


1005/341 


pMS-66 


pvHCV-43 


E2ns 


Nco 1 (Kl) - AiwN 1 (T4) 


932/314 


pgsATA-18 


pvHCV-44 


E2ns.his 


Nco 1 (KI) - AlwN 1 (T4) 


932/321 . 


pMS-66 


- pvHCV-62 . 


E1s(type 3a) 


EcoR 1 - Hind ill 


625/207 


pgsATA-18 


pvHCV-63 


E1s(type5) 


EcoR 1- Hind III 


625/207 


pgsATA-18 


pvHCV-64 


E2 


BamH 1 - Hind III 


1410/463 


pgsATA-18 


pvHCV-65 


E1-E2 


BamH 1 - Hind II! 


2072/691 


pvHCV-10A 


pvHCV-66 


CORE-E1-E2 


BamH 1 - Hind HI 


2427/809 


pvHCV-33 



nt: nucleotide aa: amlnoacid Kl; Klenow DNA Pol filling T4: T4 DNA Pol filling 
Position: aminoacid position in the HCV polyprotein sequence 
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Table 1 - continued: Recombinant vaccinia plasmids and viruses 



Plasmid 




HCVcDNA subclone 




Vector 


Name 


Name 


Construction 


Length 
(nt/aa) 


used for 
insertion 


pvHCV-81 


E1*-GLY 1 


EcoRI - BamH I 


783/262 


pvHCV-10A 


pvHCV-82 


E1M3LY2 


EcoRI - BamH 1 


783/262 


pvHCV-10A 


pvHCV-83 


ET-GLY 3 


EcoRI - BamH 1 


783/262 


pvHCV-10A 


pvHCV-84 . 


E1*-GLY4 


, EcoRI -BamH 1 


783/262 " 


pvHCAMOA 


pvHCV-85 


E1M3LY5 


EcoRI - BamH 1 


783/262 


pvHCV-1QA 


pvHCV-86 


ET-GLY 6 


EcoRI - BamH 1 


783/262 


pvHCV-10A 



nt: nucleotide aa: aminoacid Kl: Klencw DNA Pol ffllingT4: T4 DNA Pol filling 
Position: aminoacid position in the HCV polyprotein sequence 
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Table 2 : Summary of anti-E1 tests 
S/N + SD (mean anti-E1 titer) 





Start of treatment 


End of treatment 


Follow-up 


I LTR 


6.94 + 2.29(1.-3946) 


4.4fl + 2.69 (1:568) 


259 + 2.69 (1:175) 


[nr 


5.77 + 177(1:1607) * 


5.29 ±3.99 (1:1060) 


6.08 + 3.73 (1:1978) 



LTR : Long-term, sustained response for more than .1 year 
NR : No response, response with relapse, or partial response 
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Table 3 

Synthetic peptides for competition studies 



PROTEIN 
E1 



E2 



PEPTIDE 


AMINU ALIO bcUUcNUc 




SEQ ID NO 


E1-31 


LLSCLTVPASAYQVRNSTGl 


' 181-200 


56 


E1-33 


QVRNSTGLYHVTNDCPNSSI 


193-212 


0/ 


fci-oo 


wnrpw^^iwPAHnAH htp 

IHUXjilMOOl V I unnUAiu 1 1 r 


• 205-224 


58 


£TH OCA 


^M^IWPAAnMIMHTPGHV 

ulMOOl V I uArVLUVillVin 1 ruwY 


208-227 


'59 


t1-o7 


nUWi-Ti 1 rUuVr\jVnC\31iVO 


217-236 


60 


El-39 


UVMCurJVonUvv V Mm i r 1 Vrt i 




• 61 


E1-41 


A MTOT\//lTOnr k U r l OATAI DO . 

AMTr 1 VATRUCaKLr A 1 Ulnn 




62 


El -43 


U^ATQLRmlDLLVCaoATLU 




uo 


E1-45 


LV6SATLCSALYVG0LCGSV ■ 




u*f 


E1-49 


QUTTSPRRHWTTQGCNCSI 




fin 


E1-51 


TQGCNCS1YPGHITGHRMAW 


GUI -viCU 


fin 
UO 


E1-53 


ITGHRMAWDMMMNWSPTAAL 


OH *3 OOO 

31 o-doii 


R7 


El-55 


NWSPTAALVMAQLLRlPQAi 


325-344 


68 


E1-57 


LLRIPQA1LDM1AGAHWGVL 


337-356 


69 


61-59 


AGAHWGVLAGIAYFSMVGNM 


349-368 


70 


E1-63 


WLLLFAGVOAETIVSGGQA 


373-392 


71 


E2-67 


SGIVSUTPGAKQNIQUNT 


397-416 


72 


E2-69 


QNIQU^GSWHINSTALN 


409-428 


73 


E2-S3B 


LNCNESLNTGWWLAGUYQHK 


427-446 


74 


E2-$1B 


AGUYGHKFNSSGCPERLAS 


439458 


75 


E2-1B 


GCPERlASCRPLTDFDQGWG 


451470 


76 ' 


E2-3B 


TDFDQGWGPISYANGSGPOQ 


463482 


77 


E2-5B 


ANGSGPDQRPYCWHYPPKPC 


475494 


78 


E2-7B 


WHYPPKPCG1VPAKSVCGPV 


487-506 


79 


E2-9B 


AKSVCGPVYCFTPSPVWGT 


499-518 


80 


E2-118 


PSPWVGTTDRSGAPTYSWG 


511-530 


81 


E2-13B 


GAPTYSWGENDTDVFVLNNT 


• 523-542 


82 
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E2-17B 


GNWFGCTWMNSTGFTKVCGA 


547-566 


83 


E2-19B 


GrTKVCGAPPVCIGGAGNNT 


559-578 


84 


E2-21 ' 


IGGAGNNTIHCPTDCFRKHP 


571-590 


85 


E2-23 


TDCFRKHPDATYSRCGSGPW 


583-602 


86 


E2-25 


SRCGSGPWITPRCLVDYPYR 


595-614 


87 


E2-27 


CLVDYPYRLWHYPCTINYTI 


607-626 • 


88 


E2-29 


PCT1NYT1FK1RMYVGGVEH 


619-638 


89 


E2-31 


MYVGGVBiRLEAACNWTPGE 


631-650 


90 


E2-33 


ACNWTPGERCDLEDRDRSEL 


643-662 


91 


E2-35- 


EDRDRSBJSPLLLTTTQWQV 


655-674 


92 
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Table 9. Profile of adluvated E1 in Balfa/c mice 





alum 


T-ceil adjuvant 


RBI 


antibody titre (mean ± SD, n=5) 


. 96000 ± 101000 


62000 i 60000 


176000+ 149000 


antibody isotypes 


IgGI 


lgG1/2b 


IgG1/2a 


T-cell proliferation in spleen 1 (n=3) 


11750(28) 


48300(3/3) 


26000 (3/3) 


T-ce)l proliferation in lymph node 2 


no specific stimulation 


4000 


8000 


cytokine profile (spleen) 


0-4 


lFN-g/II-4 


IFN-g/lW 



1 after three s.c7i.m. immunizations, 3 randomfy selected mice were analyzed individually, the result is 
5 expressed as the mean specific cpm obtained after 4 days of E1 stimulation (1 u g/mi), the number in 
brackets refecs to the number of mice with specific stimulation above background 



10 



2 after one single intra footpath immunization (n=2), the result is expressed as the mean specific cpm 
obtained after 5 days of El stimulation (1 ug/ml) 
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Table 10. Humoral Responses: No. of Immunizations required for different E-1 antibodies levels 



Animal 


status 


seroconversion 1 


>1U/ml 2 


>iau/mi 


Marcel 


chronic 


3 


4 


5 


Peggy 


chronic 


3 


5 


>12 


Femma 


chronic 


4 


5 


>12 


Yoran 


naive 


3 


4 


5 


Marti 


naive 


2 


3 


5 



1 defined as ELISA signal higher than cut-off level if no El -antibodies were present prior to immunization, in 
the other cases the observation of a titer higher than the 3 individual time points of pre-immunization titers 
was considered as the point of seroconversion. 

10 2 the unit is defined as follows: the level of E1 antibodies in human chronic carriers, prior to interferon therapy 
and infected with genotype 1 b is < 0.1 U/ml for 50% of the patients t between 0.1 to 1 U/ml for 25% of the 
patients and > 1 U/ml in the remaining 25% of patients, n=58 
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Example 12: Immunization of a chimpanzee chronically infected.with HCV subtype lb 

5 A chimpanzee (Phil) already infected for over 13 years (5015 days before irnmuruzation) with an 
HCV subtype lb strain was vaccinated with Ei (aa 192-326) which was derived from a different 
strain of genotype lb, with a 95.1% identity on the amino acid level (see also Table 2 of WO 
99/67285 the whole of which is incorporated herein by reference), and which was prepared as 
described in examples 1-3* of WO 99/97285. The chimpanzee received in total 6 intramuscular 

10 immunizations of each 50 ug El in PBS/0.05% CHAPS mixed with RIBI R-730 
(MPLA+TDM+CWS) according to the manufacturer's protocol (Ribi Inc. Hamilton^ MT). The 6 
immunizations were given in two series of three shots with a three week interval- and with a lag 
period of 6 weeks between the two series. Starting 150 days prior to immunization, during the 
immunization period and until 1 year post irnmunization (but see below and WO 99/67285) the 

15 chimpanzee was continuously monitored for various parameters indicative for the activity of the 
HCV induced disease. These parameters included blood chemistry, ALT, AST, gammaGT. blood 
chemistry, viral load in the serum, viral load in the liver and liver histology. In addition, the immune 
answer to the immunization was monitored both on the humoral and cellular level. During this period 
the animal was also monitored for any adverse effects of the inununization, such as change in 

20 behaviour, clinical symptoms, body weight, temperature and local reactions (redness, swelling, 
indurations). Such effects were not detected. 

Clearly, ALT (and especially gammaGT, data not shown) levels decreased as soon as the antibody 
level against El reached its maximum (see, Figure 8 of WO 99/67285). ALT rebounded rather 
25 rapidly as soon as the antibody levels started to decline, but gammaGT remained at a lower level as 
long as anti-El remained detectable. 

E2 antigen in the liver decreased to almost undetectable levels during the period in which anti-El 
was detectable and the E2 antigen rebounded shortly after the disappearance of these antibodies. 
30 Together with the Core and E2 antigen becoming undetectable in the liver, the inflammation of the 
liver markedly decreased (see also Table 3 of WO 99/67285). This is a major proof that the vaccine 
induces a reduction of the liver damage, probably by clearing, at least partially, the viral antigens 
from its major target organ, the liver. 
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The viraemia level, as measured by Amplicor HCV Monitor (Roche, Basel, Switzerland), remained 
approximately unchanged in the serum during the whole study period. 

More detailed analyses of the humoral response revealed that the maximum end-point titer reached 
5 14.5 x 10 3 (after the sixth immunization) and that this titer dropped to undetectable I year post 
immunization (Figure 8 of WO 99/67285). Figure 9 of WO 99/67285 shows that the main epitopes, 
which can be mimicked by peptides, recognized by the B-cells are located at the N-teiminal region of 
E2 (peptides VI V2 and V2V3, for details on the peptides used see Table 4 of WO 99/67285). Since 
the reactivity against the recombinant El is higher and longer lasting, it can also be deduced from 

10 this figure, that the antibodies recognizing these' peptides represent only part of the total antibody 
population against El. The remaining part is directed against epitopes which cannot be mimicked by 
peptides, i.e discontinuous epitopes. Such epitopes are only present on the complete El molecule or 
even only on the particle-like structure. Such an immune response against El is unique, at least 
compared to what is normally observed in human chronic HCV carriers (WO 96/13590 to Maertens 

15 et al.) and in chimpanzees (van Doom et al., 1996), who raise anti-El antibodies in their natural 
course of infection. In those patients, anti-El. is in pan also directed to discontinuous epitopes but a 
large proportion is directed against the C4 epitope (±50% of the patient sera), a minor proportion 
against VI V2 (ranging from 2-70% depending on the genotype), and reactivity against V2V3 was 
only exceptionally recorded (Maertens et al., 1997). 

20 

Analysis of the T-cell reactivity indicated that also this compartment of the immune system is 
stimulated by the vaccine in a specific way, as the stimulation index of these T-cells rises from 1 to 
2.5, and remains somewhat elevated during the follow up period (Figure 10 of WO 99/67285). It is 
this T cell reactivity that is only seen in Long term responders to interferon therapy (see: PCT/EP 
2 5 94/03555 to Leroux-Roels et al.; Leroux-Roels et al., 1996). 

Example 13: Immunization of a chronic HCV carrier with different subtype 

30 A chimpanzee (Ton) already infected for over 10 years (3809 days before immunization) with HCV 
from genotype ia was vaccinated with El from genotype lb, with only a 79.3 % identity on the 
amino acid level (see also Table 2 of WO 99/67285), and prepared as described in the previous 
examples. The chimpanzee received a total of 6 intramuscular immunizations of 50 u,g El in 
PBS/0.05% CHAPS each mixed with RIBI R-730 according to the manufacturer's protocol (Ribi 
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Inc. Hamilton, MT). The 6 immunizations were given in two series of three shots with a three week 
interval and with a lag period of .4 weeks between the two series. Starting 250 days prior to 
immunization, during the immunization period and until 9 months (but see below and WO 99/67285) 
. post imm unizat ion the chimpanzee was continuously monitored for various parameters indicative for 
5 the activity of the HCV induced disease. These parameters included blood chemistry, ALT, AST, 
gammaGT, viral load in the serum, viral load in the liver and liver histology. In addition, the immune 
answer to the immunization was monitored both on the humoral and cellular level. During this period 
■ the animal was also monitored for any adverse effects of the immunization, such as change in 
behaviour, clinical symptoms,' body weight, temperature and local reactions (redness, swelling,' 

1 0 indurations). Such effects -were not detected. 

Clearly, ALT levels (and gammaGT levels, data not shown) decreased as soon as the antibody level 
against El reached its maximum (Figure 11 of WO 99/67285). ALT and gammaGT rebounded as 
soon as the antibody levels started to decline, but ALT and gammaGT remained at a lower level 
during the complete follow up period. ALT levels were even significantly reduced after vaccination 

15 (62 ± 6 U/l) as compared to the period before vaccination (85 + 1 1 U/I). Since less markers of tissue 
damage were recovered in the serum, these findings were a first indication that the vaccination 
induced an improvement of the liver disease. 

E2 antigen levels became undetectable in the period in which anti-El remained above a titer of 1.0 x 
20 1 0 3 , but became detectable again at the time of lower El antibody levels. Together with the 
disappearance of HCV antigens, the inflammation of the liver markedly decreased from moderate 
chronic active hepatitis to minimal forms of chronic persistent hepatitis (Table 3 of WO 99/67285). 
This is another major proof that the vaccine induces a reduction of the liver damage, probably by 
clearing, at least partially, the virus from its major target organ, the liver. 

25 

The viraemia level, as measured by Amplicor HCV Monitor (Roche, Basel, Switzerland), in the 
serum remained at approximately similar levels during the whole study period. More detailed 
analysis of the humoral response revealed that the maximum end-point, titer reached was 30 x 10 3 
(after the sixth uiimunization) and that mis titer dropped to 0.5 x 1 0 3 nine months after imrauriization 
30 (Figure 11 of WO 99/67285). Figure 12 of WO 99/67285 shows that the main epitopes, which can 
be mimicked by peptides and are recognized by the B-cells, are located at the N^rminal region 
(peptides V1V2 and V2V3, for details on the peptides used see Table 4 of WO 99/67285). Since the 
reactivity against the recombinant El is higher and longer lasting, it can also be deduced from this 
figure, mat the antibodies recognizing these peptides represent only part of the total antibody 
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population against El. The remaining part is most likely directed against epitopes which cannot be 
mimicked by peptides, i.e. discontinuous epitopes. Such epitopes are probably only present on the 
complete El molecule or even only on the particle-like structure. Such an immune response against 
EI is unique, at least compared to what is normally observed in human chronic HCV carriers, which 
5 have detectable anti-El. In those patients, anti-El is in part also discontinuous, but a large 
proportion is directed against he C4. epitope (50% of the patient sera), a minor proportion against 
V1V2 (ranging from 2-70% depending on the genotype) and exceptionally reactivity against V2V3 
was recorded (Maertens et al., 1997). As this chimpanzee is infected with an la isolate the antibody 
response was also evaluated for cross-reactivity towards a El-la antigen. As can be seen in Figure 
10 13 of WO 99/67285, such cross-reactive antibodies are indeed generated, although, they form only 
part of the total antibody population. Remarkable is the correlation between the reappearance of 
viral antigen in the liver and the disappearance of detectable anti-la El antibodies in the serum. 

Analysis of the T-cell reactivity indicated that also this compartment of the immune system is 
1 5 stimulated by the vaccine in a specific way, as the stimulation index of these T-cells rises from 0.5 to 
5, and remains elevated during the follow up period (Figure 14 of WO 99/67285). 

Example 14: Reboosting of HCV chronic carriers with El 

20 

As the El antibody titers as observed in examples 12 and 13 were not stable and declined over time, 
even to undetectable levels for the lb infected chimp, it was investigated if this antibody response 
could be increased again by additional boosting. Both chimpanzees were immunized again with three 
consecutive intramuscular immunization with a three week interval (50 pg El mixed with RIBI 
25 adjuvant). As can be judged from Figures 8 and 11 of WO 99/67285, the anti-El response could 
indeed be boosted, once again the viral antigen in the liver decreased below detection limit The viral 
load in the serum remained constant although in Ton (Figure 1 1 of WO 99/67285). A viremia level 
of < 10 5 genome equivalents per ml was measured for the first time during the follow up period. 

3 0 Notable is the finding that, as was already the case for the first series of immunizations, the 

chimpanzee infected with the subtype lb HCV strain (Phil) responds with lower anti-El titers, than 
the chimpanzee infected with subtype la HCV strain (maximum titer in the first round 14.5 x 10 3 
versus 30 x iO 3 for Ton and after additional boosting only 1.2 x 10 3 for Phil versus 40 x 10 3 for 
Ton). Although for both animals the beneficial effect seems to be similar, it could be concluded from 
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this experiment that immunization of a chronic carrier with an El protein derived from another 
subtype or genotype may be especially beneficial to reach higher titers, maybe circumventing a 
preexisting and specific immune suppression existing in the host and induced by the infecting 
subtype or genotype. Alternatively, the lower titers observed in the homologous setting (lb vaccine + 
5 lb infection) may indicate binding of file bulk of the antibodies to virus. Therefore, the induced 
antibodies may possess neutralizing, capacity. 

Example 15: Demonstration of prophylactic utility of El-vaccination in chimpanzee 

10 The HCV Els protein (amino acids 192-326) was expressed in Vero cells using recombinant 
vaccinia virus HCY11B. This vaccinia virus is essentially identical to wHCVUA (as described in 
U.S. Patent No. 6,150,134, the entire contents of which is hereby incorporated by reference) but has 
been passaged from RK13 to Vero cells. The protein was purified (by means of lentil 
chromatography, reduction-alkylation and size exclusion chromatography) essentially as described in 

1 5 example 9 of PCT/E99/04342 (WO 99/67285), making use of iodoacetamide as alkylating agent for 
the cysteines. After purification the 3% empigen-BB was exchanged to 3% betain by size exclusion 
chromatography as described in example 1 of PCT/E99/04342 mis process allows to recover Els as 
a particle. Finally the material was desalted to PBS containing 0.5% betain and an Els concentration 
of 500 ug/ml. This El was mixed with an equal volume of Alhydrogel 1.3% (Superfos, Denmark) 

20 and finally further diluted with 8 volumes of 0,9% NaCl to yield alum-adjuvanted El at a 
concentration of 50 ug El/ml and 0.13% of Alhydrogel. 

The HCV E2deitaHVRI (amino acids 412-715) was expressed in and purified from Vero essentially 
as described for El using recombinant vaccinia virus HCV101 which has been recombined from 
25 pvHCV-101 described in Example 8 of PCT/E99/04342 and wild type vaccinia virus. Also 
E2deltaHVRI behaves as a particle (measured by dynamic light scattering) after exchange of 
empigen to betain. 

Five chimpanzees were selected which tested negative for HCV-RNA and HCV-antibodies. One of 
30 the animals (Huub) was not immunized, 2 animals received 6 immunizations with 50 ug El 
adjuvanted with alum (Marti and Yoran) while the remaining 2 animals received 6 immunizations 
with 50 ug E2deltaHVRI adjuvanted with alum (Joost and Karlien). All immunizations were 
administered intra-muscularly with a 3 week interval. Humoral and cellular immune responses were 
assessed in each animal against the antigen with which they where immunized and in each animal 
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both type of responses was detected as shown in Table 11. 

Table 11: antibody titers were determined by ELISA two weeks after the 6 th immunization. A serial 

dilution of the sample was compared to an in house standard (this in house standard defined as 
5 having 1000 mU/rrU of El or anri-E2deltaHVR I antibody is a mixture of three sera from HCV 

chronic carriers selected based on a high anti-envelope titer). The stimulation index, which reflects 

the cellular immune response, was obtained by culturing PBMC, drawn from the animals two weeks 
. after the third immunization, in the presence or absence of envelope antigen and determining the 

amount of tritiated thymidine incorporated in these cells during a pulse of 18 hours after 5 days of 
0 culture. The stimulation index is the ratio of thymidine incorporated in the cells cultured with 

envelope antigen versus the ones cultured without antigen. A stimulation index of >3 is considered a 

positive signal. 





Anti-El response 


Anti-E2deitaHVRI response 




Antibody titer 


Stimulation 
index 


Antibody titer 


Stimulation 
•index 


Yoran 


14110 


10.9 






Marti 


5630 


14.2 






Joost 






3210 


8.5 


Karliea 






1770 


11.2 



1 5 Three weeks after the last of 6 immunizations all animals including the control were challenged with 
100 CID (chimpanzee infectious doses) of a genotype lb inoculum (J4.91, kindly provided by Dr. J. 
Bukh, NTH, Bethesda, Maryland). The amino acid sequence divergence between the vaccine proteins 
and the J4.91 isolate (of which the sequence information is available under accession number 
BAA01583) is 7% (9 out of 135 amino acids) for Els and 11% (32 out of 304 amino acids) for 

20 E2delta HVRI; Consequently this challenge is considered heterologous and reflects a real life 
challenge. 

All chimpanzees became HCV-RNA positive (determined with Monitor HCV, Roche, Basel, 
Switzerland) on day 7 post challenge and a first ALT and garnmaGT peak was measured between 
25 days 35 and 63. This evidences that all chimps developed acute hepatitis. Remarkably, both El 
immunized animals resolved their infection while the E2deltaHVRI and the control animal .did not 
This is evidenced by the fact that the El immunized animals lost HCV-RNA (determined with 
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Monitor HCV, Roche, Basel, Switzerland) at day 98 (Y oran) and 133 (Marti) and remained negative 
so tar until day 273 with monthly testing. All the other animals stayed RNA-positive during the 
entire follow up period of 273 days so far with ALT and gamrnaGT values not reojrning to normal 
as for the El immunized chimpanzees but gradually increasing. 

In conclusion we have shown that El-irnmunization changes the natural history of HCV infection by 
preventing evolution to a chronic infection, which is the major health problem related with HCV. 



10 Example 16: Similar El responses which allowed clearing of infection in chimpanzee can be 
induced in man 

In order to obtain a prophylactic effect of El immunization in man it is required that similar immune 
responses can be induced in man compared to chimpanzee. Therefore we vaccinated 20 male human 

1 5 volunteers, in which no anti-El responses (humoral or cellular) could be detected, with 3 doses of 20 
\ig Els formulated on 0.13% Alhydrogel in 0.5 ml. All immunizations were given intramuscularly 
with a 3 week interval. As evidenced in Table 12, 17 out of 20 volunteers indeed mounted a 
significant humoral and cellular immune response against El and this without serious adverse 
events. Only 1 volunteer (subject 021) should be considered as a non-responder since neither 

20 humoral nor cellular responses were above the cut-off level after 3 El immunizations. The 
observation that the humoral response is lower compared to chimpanzee relates to the fact that only 
3 immunizations with 20 u,g were given and not 6 with 50 ug. 

Table 12; antibody titers were determined by ELISA two weeks after the third immunization. A 
2 5 serial dilution of the sample was compared to an in house standards (this in house standard defined 
as having 1000 mU/ml of El or anti-E2deltaHVR I antibody is a mixture of three sera from HCV 
chronic carriers selected based on a high anti-envelope titer). The stimulation index (cellular immune 
response) was obtained by culturing PBMC, drawn from the individuals two weeks after the third 
iinmunization, in the presence or absence of 1 ug of Els and detennining the amount of tritiated 
30 mymidine incorporated in these cells during a pulse of 18 hours after 5 days of culture. -The 
stimulation index is the ratio of thymidine incorporated in the cells cultured with envelope antigen 
versus the ones cultured without antigen. A stimulation index of >3 is considered a positive signal. 



Subject no 



Antibody titer 



Stimulation index 
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002 


1370 


30.9 


003 


717 . 


132 


004 


800 . 


9.1 


007 


680 


3.8 


008 


1026 


3.9 


009 


325 


4.6 


010 


898 


7.7 


Oil 


284 


4.1 


012 


181 


3.6 


013 


<20 


3.5 


014 


49 


4.6 


015 


228 


3.8 


016 


324 


4.1 


017 


<20* 


6.2 


013 


<20 


6.7 


019 


624 


3.1 


020 


84 


5.5 


021 


<20 


2.1 


022 


226 


2.7 


023 


163 


7.6 



ELISA signal was seen between the preimmune sample and the sample after three irnmunhation, the 
titer however is very low and does not allow accurate determination. 



5 
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Example 17: Boosting of E1 responses in vaccinated healthy volunteers 

19 out of the 20 human volunteers of example 16 were boosted once more with 20 pg E1s formulated on 0.13% 
Aihydrogel in 0.5 ml at week 26 (i.e. 20 weeks after the third immunization). Again antibody titers and cellular 
5 immune responses were determined 2 weeks after this additional immunization. In all individuals the antibody titer 
had decreased during the 20 week interval but could easily be boosted by this additional immunization to a level 
equal or higher of that observed at week 8. On average the antibody titer was double as high after this boost 
compared to the week 8 titer, and 7 times as high compared to the week 26 titer (Table 13). 

1 0 Table 1 3: antibody titers were determined by EUS A two weeks (= week 8) and 20 weeks (= week 26) after the 
third immunization and finally also 2 weeks after the boost (= Week 28). A serial dilution of the sample was 
compared to an in house standards (this in house standard defined as having 1000 mU/ml of E1 antibody, is a 
mixture of three sera from HCV chronic carriers selected based on a high anti-envelope titer). For accurate 
comparison the determination of the titer at week 8 was repeated within the same assay as for the week 26 and 

15 28 samples, which explains the differences with table 1 2 of example 1 6. 



Subject no 


Antibody titer 




Week 8 


Week 26 


Week 28 


002 


1471 


443 


3119 


003 


963 


95 


2355 


004 


1006 


409 


2043 


007 


630 


65 


541 


008 


926 


81 


819 


009 


704 


77 


269 


010 


1296 


657 


3773 


011 


253 


65 


368 


012 


254 


148 


760 


013 


36 


<20 


166 


014 


53 


40 


123 


016 


159 


45 


231 


017 


109 


39 


568 


018 


43 


23- 


50 



WO 02/055548 



PCT/EP02/00219 



019 


425 


157 


1894 


020 


73 


33 


113 


021 


25 


<20 


26 


022 


280 


150 


357 


024 


177 


81 


184 


average 


467 


138 


936 



Remarkably the T-cell responses were for the majority of Individuals still high after the 20 week Interval. Taking in 
account a normalization to the tetanos response, which is present in most individuals as a consequence of 
previous vaccinations, there is no change in the geomeatric mean of the stimulation index. After the additional 
5 boost, taking in account a normalization to the tetanos response, no change is noted (figure 51 ). This confirms 
that a strong T-help response was induced after 3 E1 immunizations and indicates that these immunizations 
induced already a very good T-he)p memory which requires, at leeast for a period of 6 months, no further 
boosting. 

10 Legend to figure 51 : The stimulation index (cellular immune response) was obtained by culturing PBMQ (1 0 s 
cells), drawn from the individuals before immunization (week 0), two weeks after the third immunization (week 8), 
before the booster immunization (week 26) and two weeks after the booster immunization (week 28), in the 
presence or absence of 3 p g of recombinant E1 s or 2 p g tetanos toxoid and determining the amount of tritiated 
thymidine incorporated in these cells during a pulse of 18 hours after 5 days of culture. The stimulation index is 

15 the ratio of thymidine incorporated in the cells cultured with envelope antigen versus the ones cultured without 
antigen. Samples of week 0 and 8 were determined In a first assay (A), while the samples of week 26 and 28 
were determined in a second assay (B) in which the samples of week 0 were reanalyzed. Results are expressed 
as the geometric mean stimulation index of all 20 (A, experiment) or 1 9 (B f experiment) volunteers. 

20 In addition the Th1 cytokine interferon-gamma and Th2 cytokine interieukin-5 were measured in the supernatants 
of the PBMC cultures of samples taken at week 26 and 28 and restlmulated with E1 . As can be judged from 
figure 52 the predominant cytokine secreted by the E1 stimulated PBMC is interferon-gamma. it is highly 
surprising to see that a strong Th1 biased response is observed with an alum adjuvanted El, since alum is known 
to be a Th2 inducer. Once more the* results confirm that a good T-cell memory response is induced, as prior to 

25 the final boost (week 26) already a very strong response Is observed. The interferon-gamma secretion was found 
to be specific as in an additional experiment we saw no difference in interferon-gamma secretion between E1 
stimulated cell cultures and non-stimulated cell cultures of these volunteers using samples drawn at week 0. 
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Legend to figure 52: PBMC (10 s cells), drawn from the individuals before the booster immunization (week 26) and 
two weeks after the booster immunization (week 28), were cultured in the presence of 3 pg of recombinant E1s 
(E1) or 2 tig of tetanos toxoid (TT) or no antigen (Bl). Cytokines were measured in the supernatant taken after 24 

5 hours (interleukin-5) or after 120 hours (interferon-gamma) by means of ELISA. The stimulation index is the ratio 
of cytokine measured in the supernatants of cells cultured with envelope antigen versus the ones cultured without 
antigen. Results are expressed as the geometric mean of pg cytoldna/ml secreted of all 19 volunteers. Samples 
with a cytokine amount below detection limit were assigned the value of the detection limit. Similarly samples with 
extremely high concentrations of cytokine out of the linear range of the assay were assigned the value of the limit 

10 of the linear range of the assay. 

Example 1 8: Fine mapping of cellular response against E1 in vaccinated healthy volunteers. 

In order to map the E1 specific responses a series of 20-mer peptides was synthesized, using standard Fmoc 
15 chemistry, with 8 amino acids overlap and covering the entire sequence of E1s, All peptides were C-terminally 
amidated and N-terminally acetylated, with the exception of IGP 1626 which has a free amino-terminus. 

IGP 1626 YEVRNVSGIYHVTNDCSNSS (amino acid 192-21 1) 

IGP 1627 TNDCSNSSIVYEAADMIMHT (amino acid 204-223) 

20 IGP 1 628 MDMIMHTPGCVPCVRENNS (amino acid 216-235) 

IGP 1629 PCVRENNSSRCWVALTPTLA (amino acid 228-247) 

IGP 1 630 VALTPTLAARNASVPTTTIR (amino acid 240-259) 

IGP 1 631 SVPTTTIRRHVDLLVGAAAF (amino acid 252-271) 

IGP 1 632 LLVGAAAFCSAMYVGDLCGS (amino acid 264-283) 

25 IGP 1633 YVGDLCGSVFLVSQLFTISP (amino acid 276-295) 

■ IGP 1634 SQLFTISPRRHETVQDCNCS (amino acid 288-307) 

IGP 1635 TVQDCNCSIYPGHiTGHRMA (amino acid 300-319) 

IGP 1636 HITGHRMAWDMMMNWSPTTA (amino acid 312-331) 

30 PBMC from 14 different healthy donors not vaccinated with E1s or 10 donors vaccinated with E1s were cultured 
in the presence of 25 pg/ml (non vaccinated persons) or 10 (jg/ml (vaccinated persons, samples taken after the 
third or booster injection) of each peptide separately. As can be judged from figure 53 the peptides IGP 1 627, 
1629, 1630, 1631, 1633, 1635 and 1635 all induced significantly higher responses in vaccinated persons 
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compared to non-vaccinated persons. Using a stimulation index of 3 as cut-off the peptides IGP 1627, 1629, 1 631 
and 1635 were the most frequently recognized (i.e. recognized by at least half of the vaccinated persons tested). 
This experiment proofs that the T-cell responses induced by E1 s derived from mammalian ceil culture are specific 
against El since these responses can not only be recalled by the same Els derived from mammalian cell culture 
5 but also by synthetic peptides. In addition this experiment delineates the most immunogenic T-cefl domains in E1 
are located between amino acids 204-223, 228-271 , 276*295, 300-331 and more particularly even between 
amino acids 204-223, 228-247, 252-271 and 300-319. 

Legend to figure 53: The stimulation index (cellular immune response) was obtained by culturing PBMC (3 x10 5 
10 cells), in the presence or absence of peptides and determining the amount of tritiated thymidine incorporated in 
these cells during a pulse after 5-6 days of culture. The stimulation index is the ratio of thymidine incorporated in 
the cells cultured with peptide versus the ones cultured without peptide. Results are expressed as individual 
values for vaccinated persons (top panel) or non vaccinated or controls (lower pane!). 

15 The present invention also provides therefor, the following E1 peptides, proteins, compisitions and kits 

containing the same, nucleic acid sequences coding for these peptides and proteins containing the same, and 
methods of their manufacture and use, as are generally described herein for other E1 and related peptides of the 



present invention. 




IGP 1626 


spanning positions 192-211 of the E1 region (SEQ ID NO:112), 


IGP 1627 


spanning positions 204-223 of the E1 region (SEQ ID NO;113), 


IGP 1628 


spanning positions 216-235 of the E1 region (SEQ ID NO:114), 


IGP 1629 


spanning positions 228-247 of the E1 region (SEQ ID NO:1 15), 


IGP 1630 


spanning positions 240-259 of the El region (SEQ ID NO;1 1 6), 


IGP 1631 


spanning positions 252-271of the E1 region (SEQ ID NO:1 17), 


IGP 1632 


spanning positions 264-283 of the E1 region (SEQ ID NO:118), 


IGP 1633 


spanning positions 276-295 of the E1 region (SEQ ID NO:1 19), 


IGP 1634 


spanning positions 288-307 of the E1 region (SEQ ID NO:120), 


IGP 1635 


spanning positions 300-319 of the E1 region (SEQ ID NO:121), 


IGP 1636 


spanning positions 312-331 of the E1 region (SEQ ID NO:122). 



30 
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We claim: 

1 . A therapeutic vaccine composition comprising a therapeutic effective amount of: 

a composition comprising at least one purified recombinant HCV single or specific oiigomeric recombinant 
envelope proteins selected from the group consisting of an E1 protein and an E2 protein; and optionally a 
pharmaceuticaliy acceptable adjuvant 

2. A composition according to claim 1 wherein said recombinant HCV envelope proteins are produced by 
recombinant mammalian ceils. 

3. A composition according to claim 1 wherein said recombinant HCV envelope proteins are produced by 
recombinant yeast cells. 

4. A therapeutic vaccine composition comprising a therapeutically effective amount of a composition 
comprising at least one of the following E1 and E2 peptides: 

E1-31 (SEQ ID NO:56) spanning amino acids 181 to 200 of the Core/E1 V1 region, 
E1-33 (SEQ ID NO:57) spanning amino acids 193 to 212 of the E1 region, 
E1 -35 (SEQ ID NO:58) spanning amino acids 205 to 224 of the E1 V2 region (epitope B), 
E1 -35A (SEQ ID NO:59) spanning amino acids 208 to 227 of the E1 V2 region (epitope B), 
1 bE1 (SEQ ID NO:53) spanning amino adds 1 92 to 228 of El regions V1 , C1 , and V2 regions (containing 
• epitope B), 

E1-51 (SEQ ID NO:66) spanning amino acids 301 to 320 of the El region, 

E1 -53 (SEQ ID NO:67) spanning amino acids 31 3 to 332 of the E1 C4 region (epitope A), 

E1 -55 (SEQ ID NO;68) spanning amino acids 325 to 344 of the E1 region, 

Env 67 or E2-67 (SEQ ID NO:72) spanning amino acid positions 397 to 418 of the E2 region (epitope A), 

Env 69 or E2-69 (SEQ ID NO:73) spanning amino acid positions 409 to 428 of the E2 region (epitope A), 

Env 23 or E2-23 (SEQ ID NO;86) spanning positions 583 to 602 of the E2 region (epitope E), 

Env 25 or E2-25 (SEQ ID NO:87) spanning positions 595 to 614 of the E2 region (epitope E), 

Env 27 or E2-27 (SEQ ID NO:88) spanning positions 607 to 626 of the E2 region (epitope E), 

Env 178 or E2-178 (SEQ ID NO:83) spanning positions 547 to 586 of the E2 region (epitope D), 

Env 13B or E2-13B (SEQ ID NO:82) spanning positions 523 to 542 of the E2 region (epitope C), 

1GP 1 626 spanning positions 1 92-21 1 of the El region (SEQ ID NO:1 12), 

IGP 1 627 spanning positions 204-223 of the E1 region (SEQ ID NO:1 13), 

IGP 1628 spanning positions 216-235 of the E1 region (SEQ )D NO:114), 

IGP 1629 spanning positions 228-247 of the E1 region (SEQ ID N0:1 15), 

IGP 1630spanning positions 240-259 of the E1 region (SEQ ID NO:1 16), 
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JGP 1631 spanning positions 252-271 of the E1 region (SEQ ID N0:1 17), 
IGP 1632 spanning positions 264-283 of the EVregion (SEQ ID NO:1 18), * 
IGP 1633 spanning positions 276-295 of the E1 region (SEQ ID NO:1 19), 
IGP 1634 spanning positions 288-307 of the E1 region (SEQ ID NO:120), 
iGP-1635 spanning positions 300-31 9 of the E1 region (SEQ ID NO:1 21 ) and 
IGP 1636 spanning positions 312-331 of the E1 region (SEQ ID NO:122). 

5. A method of treating a mammal infected with HCV comprising administering an effective amount of a 
composition according to any one of claims 1-4 and, optionally, a pharmaceutical^ acceptable adjuvant 

6. The method of claim 5 wherein said mammal is a human. 

7. A composition comprising at least one purified recombinant HCV recombinant envelope proteins selected 
from the group consisting of an E1 protein and an E2 protein, and optionally an adjuvant. 

8. A composition comprising at least one of the following E1 and E2 peptides: 
E1-31 (SEQ ID NO:56) spanning amino acids 181 to 200 of the Core/E1 V1 region, 
E1 -33 (SEQ ID NO:57) spanning amino acids 1 93 to 21 2 of the E1 region, 

E1-35 (SEQ ID NO:58) spanning amino acids 205 to 224 of the E1 V2 region (epitope B), 
• E1-35A (SEQ ID NO:59) spanning amino acids 208 to 227 of the E1 V2 region (epitope B), 
1bE1 (SEQ ID NO:53) spanning amino acids 1 92 to 228 of E1 regions VI, CI, and V2 regions (containing 
epitope B), 

E1-51 (SEQ ID NO;66) spanning amino acids 301 to 320 of the El region, 

E1-53 (SEQ ID NO:67) spanning amino acids 313 to 332 of the E1 C4 region (epitope A), 

E1 -55 (SEQ ID NO:68) spanning amino acids 325 to 344 of the E1 region, 

Env 67 or E2-67 (SEQ ID NO:72) spanning amino acid positions 397 to 41 8 of the E2 region (epitope A), 

Env 69 or E2-69 (SEQ ID NO:73) spanning amino acid positions 409 to 428 of the E2 region (epitope A), 

Env 23 or E2-23 (SEQ ID NO:86) spanning positions 583 to 602 of the E2 region (epitope E), 

Env 25 or E2-25 (SEQ ID NO:87) spanning positions 595 to 614 of the E2 region (epitope E), 

Env 27 or E2-27 (SEQ ID NO:88) spanning positions 607 to 626 of the E2 region (epitope E), 

Env 178 or E2-178 (SEQ ID NO:83) spanning positions 547 to 586 of the E2 region (epitope D), 

Env 1 3B or E2-1 3B (SEQ ID NO:82) spanning positions 523 to 542 of the E2 region (epitope C), 

IGP 1626 spanning positions 192-211 of the E1 region (SEQ ID N0:112), 

IGP 1 627 spanning positions 204-223 of the E1 region (SEQ ID NO:1 1 3), 

IGP 1628 spanning positions 216-235 of the E1 region (SEQ ID NO:1 14), 

IGP 1629 spanning positions 228-247 of the E1 region (SEQ ID NO:1 15), 

IGP 1 630 spanning positions 240-259 of the E1 region (SEQ IDNOM1 6), 
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IGP 1631 spanning positions 252-271of the E1 region (SEQ ID N0:117), 
IGP 1 632spanning positions 264-283 of the E1 region (SEQ ID N0:118), 
IGP 1633spanning positions 276-295 of the E1 region (SEQ ID NCX119), 
IGP 1 634 spanning positions 288-307 of the E1 region (SEQ ID NOM20), 
IGP 1635 spanning positions 300-319 of the E1 region (SEQ ID N0:12t) and 
IGP 1636spanning positions 312-331 of the E1 region (SEQ ID NO:122). 

9. A therapeutic composition for inducing HCV-specific antibodies comprising a therapeutic effective amount 
of a composition comprising an E1/E2 complex formed from purified recombinant HCV single or specific oligomerie 
recombinant El or E2 proteins; and optionally a pharmaceuiically acceptable adjuvant 

10. A composition according to claim 9 wherein safe! recombinant HCV envelope proteins are produced by 
recombinant mammalian cells. 

1 1. A composition according to claim 9 wherein said recombinant HCV envelope proteins are produced by 
recombinant yeast cells. 

1 2. A method of treating a mammal infected with HCV comprising administering an effective amount of a 
composition according to any one of claims 9-1 1 and, optionally, a pharmaceuiically acceptable adjuvant 

13. The method of claim 12 wherein said mammal is a human. 

14. A therapeutic composition for inducing HCV-specific antibodies comprising a therapeutic effective 
amount of a composition comprising at least one purified recombinant HCV single or specific oligomerie recombinant 
envelope protein selected from the group consisting of an E1 protein and an E2 protein; and optionally a 
pharmaceutical^ acceptable adjuvant 
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Fig. 21 A 

5* GGCATGCAAGCTTAATTAATT3 , (SEQ ID NOD 
3*ACGTCCGTACGTTCGAATTAATTAATCGA5* (SEQ ID NO 94) 



- 5 'CCGGGG AGGCCTGCACGTGATCGAGGGCAG ACACCATC ACCACC ATCACTAATAGT • 
' TAATTAACTGCA 3 1 (SEQ ID NO 2) ^ 
3'CCTCCGGACGTGCACTAGCTCCCGTCTGTGGTAGTGGTGGTAGTGATTATCAATTAATTG 
5' (SEQ ID NO 95) 



SEQ ID NO 3 (HCC19A) 

ATGCCCGGTTGCTCTTTCTC7ATCTTCCTCTTGGCTTTACTGTCCTGTCTGACCATTCCA 
GCTTCCGCTTATGAGGTGCGCAACGTGTCCGGGATGTACCATGTCACGAACGACTGCT 
CCAACTCAAGCATTGTGTATGAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGT 
GCCCTGCGTTCGGGAGAACAACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTC 
GCAGCTAGGAACGCCAGCGTCCCCACCACGACAATACGACGCCACGTCGATTTGCTCG 
TTGGGGCGGCTGCTCTCTGTTCCGCTATGTACGTGGGGGATCTCT.GCGGATCTGTCTTC 
CTCGTCTCCCAGCTGTTCACCATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCA 
ATTGCTCAATCTATCCCGGCCACATAACAGGTCACCGTATGGCTTGGGATATGATGAT 
GAACTGGTCGCCTACAACGGCCCTGGTGGTATCGCAGCTGCTCCGGATCCCACAAGCT 
GTCGTGGACATGGTGGCGGGGGCCCATTGGGGAGTCCTGGCGGGCCTCGCCTACTATT 
CCATGGTGGGGAACTGGGCTAAGGTTTTGATTGTGATGCTACTCTTTGCTCTCTAATAG 

SEQ ID NO 5 (HCCI10AI- 

ATGTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACC7CGTGGGGTACA 
TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 
ATCTTCCTCTTGGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAAGTGCG 
CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTAT 
GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 
AACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCG 
7CCCCACCACGACAATACGACGCCACGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTG 
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Fig. 21 B 

TTCCGCTATGTACGTGGGGGACCTCTGCGGATCTGTCTTCCTCGTCTCCCAGCTGTTCA 
CCATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGG 
CCACATAACGGGTCACCGTATGGCTTGGGATATGATGATGAACTGGTCGCCTACAACG 
GCCCTGGTGGTATCGCAGCTGCTCCGGATCCCACAAGCTGTCGTGGACATGGTGGCGG 
GGGCCCATTGGGGAGTCCTGGCGGGTCTCGCCTACTATTCCATGGTGGGGAACTGGGC 
TAAGGTTTTGATTGTGATGCTACTCTTTGCTCCCTAATAG 

SEQ ID NO 7 (HCCIUA) ' . 

ATGTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCATGGGGTACA 
TTCCGCTCGTCGGCGCCCCCCTAGGGGGTGCTGCCAGAGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAATTTGCCTGGTTGCTCTTTCTCTA 
TCTTCCTCTTGGCTTTACTGTCCTGTCTGACCATTCCAGCTTCCGCTTATGAGGTGCGC 
AACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTATG 
AGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACA 
ACTCT7CCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCGT 
CCCCACTACGACAATACGACGCCACGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTGTT 
CCGCTATGTACGTGGGGGATCTCTGCGGATCTGTCTTCCTCGTCTCCCAGCTGTTCACC 
ATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCAATTGC7CAATCTATCCCGGCC 
ACATAACAGGTCACCGTATGGCTTGGGATATGATGATGAACTGGTAATAG 

ScQ ID NO 9 (HCCI12A) 

ATGCCCGGTTGCTCTTTCTCTA7CTTCCTCTTGGCCCTGCTGTCCTGTCTGACCATACCA 
GCTTCCGCTTATGAAGTGCGCAACGTGTCCGGGGTGTACCATGTCACGAACGACTGCT 
CCAACTCAAGCATAGTGTATGAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGT 
GCCCTGCGTTCGGGAGGGCAACTCCTCCCGTTGCTGGGTGGCGCTCACTCCCACGCTC 
GCGGCCAGGAACGCCAGCGTCCCCACAACGACAATACGACGCCACGTCGATTTGCTC 
GTTGGGGCTGCTGCTTTCTGTTCCGCTATGTACGTGGGGGATCTCTGCGGATCTGTTTT 
CCTTGTTTCCCAGCTGTTCACCTTCTCACCTCGCCGGCATCAAACAGTACAGGACTGCA 
ACTGCTCAATCTATCCCGGCCATGTATCAGGTCACCGCATGGCTTGGGATATGATGAT 
GAACTGGTCCTAATAG 

SEQ ID NO 11 (HCC113A) 

ATGTCCGGTTGCTCT7TCTCTATCTTCCTCTTGGCCCTGCTGTCCTGTCTGACCATACCA 
GCTTCCGCTTATGAAGTGCGCAACGTGTCCGGGGTGTACCATGTCACGAACGACTGCT 
CCAACTCAAGCATAGTGTATGAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGT 
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Fig. 21 C 

GCCCTGCGTTCGGGAGGGCAACTCCTCCCGTTGCTGGGTGGCGCTCACTCCCACGCTC 
GCGGCCAGGAACGCCAGCGTCCCCACAACGACAATACGACGCCACGTCGATTTGCTC 
GTTGGGGCTGCTGCTTTCTGTTCCGCTATGTACGTGGGGGATCTCTGCGGATCTGTTTT 
CCTTGTTTCCCAGCTGTTCACCTTCTCACCTCGCCGGCATCAAACAGTACAGGACTGCA 
ACTGCTCAATCTATCCCGGCCATGTATCAGGTCACCGCATGGCTTGGGATATGATGAT 
GAACTGGTAATAG 

SEQ ID NO 13 (HCCI17A) 

ATGCTGGGTAAGGCCATCGATACCCTTACGTGCGGCTTCGCCGACCTCGTGGGGTACA 
TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAATTTGCCTGGTTGCTCTTTCTCTA 
TCTTCCTCTTGGCTTTACTGTCCTGTCTAACCATTCCAGCTTCCGCTTACGAGGTGCGC 
AACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTA7G 
AGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAACA 
ACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCGGCTAGGAACGCCAGCAT 
CCCCACTACAACAATACGACGCCACGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTGTT 
CCGCTATGTACGTGGGGGATCTCTGCGGATCTGTCTTCCTCGTCTCCCAGCTGTTCACC 
ATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGGCC 
ACATAACGGGTCACCGTATGGCTTGGGATATGATGATGAACTGGTACTAATAG 

SEQ ID NO 15 (HCPrSl) 

ATG CC C GGTTGCTCTTTCTCTATCTT 

SEQ ID NO 16 (HCPr=2). 
ATGTTGGGTAAGGTCATCGATACCCT 

SEQ ID NO 17 |HCPr53) 
CTATTAGGACCAGTTCATCATCATATCCCA 

SEQ ID NO 18 {HCPr54J 
CTATTACCAGTTCATCATCATATCCCA 

SEQ ID NO 19 (HCPr107) 

ATACGACGCCACGTCGATTCCCAGCTGTTCACCATC 
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Fig. 21 D 

S£Q ID NO 20 (HCPMOSI 

GATGGTGAACAGCTGGGAATCGACGTGGCGTCGTAT 
SEQ ID NO 21 (HCC137) 

ATGTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGGGGTACA 
TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 
ATCTTCCTCTTGGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAAGTGCG 
CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTAT 
GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 
AACTCTTCCCGCTGC7GGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCG 
TCCCCACCACGACAATACGACGCCACGTCGATTCCCAGCTGTTCACCATCTCGCCTCG 
CCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGGCCACATAACGGGT 
CACCGTATGGCTTGGGATATGATGATGAACTGGTCGCCTACAACGGCCCTGGTGGTAT 
CGCAGCTGCTCCGGATCCCACAAGCTGTCGTGGACATGGTGGCGGGGGCCCATTGGGG 
AGTCCTGGCGGGTCTCGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGATTG 
TGATGCTACTCTTTGCTCCCTAATAG 

SEC ID NO 23 (HCCS38) 

ATGTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGGGGTACA 

TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 

GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 

ATCTTCCTCTTGGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAAGTGCG 

CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTAT 

GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 

AACTCTTCCCGC7GCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCG 

TCCCCACCACGACAATACGACGCCACGTCGATTCCCAGCTGTTCACCATCTCGCCTCG 

CCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGGCCACATAACGGGT 

CACCGTATGGCTTGGGATATGATGATGAACTGGTAA 

TAG 

SEQ ID NO 25 {HCCI39) 

ATGTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGGGGTACA 
TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 
GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTGTCT 
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Fig. 21 E 

ATCTTCCTCTTG G CTTTG CTGTC CTGTCTG AC CGTTC C AG CTTC CGCTTATG A A GTGCG 
CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTAT 
GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 
AACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACGCCAGCG 
TCCCCACCACGACAATACGACGCCACGTCGATTCCCAGCTGTTCACCATCTCGCCTCG 
CCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGGCCACATAACGGGT 
CACCGTATGGCTTGGGATATGATGATGAACTGGTCGCCTACAACGGCCCTGGTGGTAT 
CGCAGCTGCTCCGGATCCTCTAATAG • 

ScG ID NO 27 (HCC140) 

ATGTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGGGGTACA 

TTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 

GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 

ATCTTCCTCTTGGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAAGTGCG 

CAACGTGTCCGGGATGTACCATG7CACGAACGAC7GCTCCAACTCAAGCATTGTGTAT 

GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 

AACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCJAGGAACGCCAGCG ' 

TCCCCACCACGACAATACGACGCCACGTCGATTCCCAGCTGTTCACCATCTCGCCTCG 

CCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGGCCACATAACGGGT 

CACCGTATGGCTTGGGATATGATGATGAACTGGJCGCCTACAACGGCCCJGGTGGTAT 

CGCAGCTGCTCCGGATCGTGATCGAGGGCAGACACCATCACCACCATCAGTAATAG 

SEC ID NO 29 (HCCI62) 

ATGGGTAAGGTCATCGATACCCTTACGTGCGGATTCGCCGATCTCATGGGGTACATCC 
CGCTCGTCGGCGCTCCCGTAGGAGGCGTCGCAAGAGCCCTTGCGCATGGCGTGAGGGC 
CCTTGAAGACGGGATAAATTTCGCAACAGGGAATTTGCCCGGTrGCTCCTTTTCTATTT 
TCCTTCTCGCTCTGTTCTCTTGCTTAATTCATCCAGCAGCTAGTCTAGAGTGGCGGAAT 
ACGTCTGGCCTCTATGTCCTTACCAACGACTGTTCCAATAGCAGTATTGTGTACGAGGC 
CGATGACGTTATTCTGCACACACCCGGCTGCATACCTTGTGTCCAGGACGGCAATACA 
TCCACGTGCTGGACCCCAGTGACACCTACAGTGGCAGTCAAGTACGTCGGAGCAACCA 
CCGCTTCGATACGCAGTCATGTGGACCTATTAGTGGGCGCGGCCACGATGTGCTCTGC 
GCTCTACGTGGGTGACATGTGTGGGGCTGTCTTCCTCGTGGGACAAGCCTTCACGTTCA 
GACCTCGTCGCCATCAAACGGTCCAGACCTGTAACTGCTCGCTGTACCCAGGCCATCT 

TTCAGGACATCGAATGGCTTGGGATATGATGATGAACTGGTAATAG 
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Fig. 21 F 

SEQ ID NO 31 (HCC!63> 

ATGGGTAAGGTCATCGATACCC7AACGTGCGGATTCGCCGATCTCATGGGGTATATCC 
CGCTCGTAGGCGGCCCCATTGGGGGCGTCGCAAGGGCTCTCGCACACGGTGTGAGGGT 
CC77GAGGACGGGG7AAAC7A7GCAACAGGGAA777ACCCGG77GC7C777C7C7A7C7 
TT A77C77GC7C77C7C7CG7G7C7GACCG77CCGGCC7C7GCAG77CCC7ACCGAAA7G 
CCTCTGGGATTTATCATGTTACCAATGATTGCCCAAACTCTTCCATAGTCTATGAGGCA 
GATAACCTGATCCTACACGCACCTGGTTGCGTGCCTTGTGTCATGACAGGTAATGTGA 
GTAGATGCTGGGTCCAAATTACCCCTACACTGTCAGCCCCGAGCCTCGGAGCAGTCAC. 
GGCTCCTCTTCGGAGAGCCGTTGACTACCTAGCGGGAGGGGCTGCCCtCTGCTCCGCG 
TTATACGTAGGAGACGCGTGTGGGGCACTATTCTTGGTAGGCCAAATGTTCACCTATA 
GGCCTCGCCAGCACGC7ACGGTGCAGAACTGCAACTGT7CCATTTACAGTGGCCATGT 
7ACCGGCCACCGGA7GGCA7GGGA7A7GA7GA7GAAC7GG7AA7AG 

SEQ ID NO 33 (HCPrl09J 
7GGGA7A7GA7GA7GAAC7GG7C 

SEQ ID NO 34 (HCPf72) 

C7A77A7GG7GG7AAKGCCARCARGAGCAGGAG 
SEQ !D NO 35 (HCCL22A) 

7GGGA7A7GA7GATGAACTGG7CGCC7ACAACGGCCC7GG7GG7A7CGCAGC7GC7CC 
GGA7CCCACAAGCTG7CG7GGACA7GG7GGCGGGGGCCCAT7GGGGAG7CC7GGCGG 
GCC7CGCC7AC7A7TCCA7GG7GGGGAAC7GGGC7AAGG777TGG77G7GA7GC7AC7C 
777GCCGGCG7CGACGGGCA7ACCCGCG7G7CAGGAGGGGCAGCAGCC7CCGA7ACCA 
GGGGCC77G7G7CCC7C7T7AGCCCCGGG7CGGC7CAGAAAA7CCAGC7CG7AAACAC 
CAACGGCAG77GGCACATCAACAGGAC7GCCC7GAAC7GCAACGAC7CCC7CCAAAC 
AGGG77C777GCCGCAC7A77C7ACAAACACAAA77CAAC7CG7C7GGA7GCCCAGAG 
CGCT7GGCCAGC7G7CGC7CCA7CGACAAG77CGC7CAGGGG7GGGG7CCCC7CAC77 
ACAC7GAGCC7AACAGC7CGGACCAGAGGCCC7AC7GC7GGCAC7ACGCGCC7CGACC 
G7G7GG7A77G7ACCCGCGTC7CAGG7G7GCGG7CCAG7GTA7TGC7JCACCCCGAGCC 
C7G77G7GG7GGGGACGACCGA7CGGT77GG7G7CCCCACG7A7AAC7GGGGGGCGAA 
CGAC7CGGA7G7GC7GA77C7CAACAACACGCGGCCGCCGCGAGGCAAC7GGTTCGGC 
TG7ACA7GGATGAATGGCAC7GGG7TCACCAAGACGTGTGGGGGCCCCCCG7GCAACA 
TCGGGGGGGCCGGCAACAACACC7TGACC7GCCCCAC7GAC7G7T77CGGAAGCACCC 
CGAGGCCACCTACGCCAGA7GCGG7TC7GGGCCC7GGC7GACACC7AGG7G7ATGGTT 
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Fig. 21 G 

CATTACCCATATAGGCTCTGGCACTACCCCTGCACTGTCAACTTCACCATCTTCAAGGT 
TAGGATGTACGTGGGGGGCGTGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCG 
AGGAGAGCGTTGTGACTTGGAGGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTG 
TCTACAACAGAGTGGCAGATACTGCCCTGTTCCTTCACCACCCTGCCGGCCCTATCCA 
CCGGCCTGATCCACCTCCATCAGAACATCGTGGACGTGCAATACCTGTACGGTGTAGG 
GTCGGCGGTTGTCTCCCTTGTCATCAAATGGGAGTATGTCCTGTTGCTCTTCCTTCTCCT 
GGCAGACGCGCGCATCTGCGCCTGCTTATGGATGATGCTGCTGATAGCTCAAGCTGAG 
GCCGCCTTAGAGAACCTGGTGGTCCTCAATGCGGCGGCCGTGGCCGGGGCGCATGGC 
ACTCTTTCCTTCC7TGTGTTCTTCTGTGCTGCCTGGTACATCAAGGGCAGGCTGGTCCC 
TGGTGCGGCATACGCCTTCTATGGCGTGTGGCCGCTGCTCCTGCTTCTGCTGGCCTTAC 
CACCACGAGCTTATGCCTAGTAA 

SEQID NO 37 (HCCI41J 

GATCCCACAAGCTGTCGTGGACATGGTGGCGGGGGCCCATTGGGGAGTCCTGGCGGG 

CCTCGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGGTTGTGATGCTACTCT 

TTGCCGGCGTCGACGGGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCA 

GGGGCCTTGTGTCCG7C7TTAGCCCCGGGTGGGCTCAGAAAATCCAGCTCGTAAACAC . 

CAACGGCAGTTGGCACATCAACAGGACTGCCCTGAACTGCAACGACTCCCTCCAAAC 

AGGGTTCTTTGCCGCACTATTCTACAAACACAAATTCAACTCGTCTGGATGCCCAGAG 

CGCTTGGCCAGCTGTCGCTCCATCGACAAGTTCGCTCAGGGGTGGGGTCCCCTCACTT 

ACACTGAGCCTAACAGCTCGGACCAGAGGCCCTACTGCTGGCACTACGCGCCTCGACC 

GTGTGGTATTGTACCCGCGTCTCAGGTGTGCGGTCCAGTGTATTGCTTCACCCCGAGCC 

CTGTTGTGGTGGGGACGACCGATCGGTTTGGTGTCCCCACGTATAACTGGGGGGCGAA 

CGACTCGGATGTGCTGATTCTCAACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGC 

TGTACATGGATGAATGGCACTGGGTTCACCAAGACGTGTGGGGGCCCCCCGTGCAACA 

TCGGGGGGGCCGGCAACAACACCTTGACCTGCCCCACTGACTGTTTTCGGAAGCACCC 

CGAGGCCACCTACGCCAGATGCGGTTCTGGGCCCTGGCTGACACCTAGGTGTATGGTT 

CATTACCCATATAGGCTCTGGCACTACCCCTGCACTGTCAACTTCACCATCTTCAAGGT 

TAGGATGTACGTGGGGGGCGTGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCG 

AGGAGAGjCGTTGTGACTTGGAGGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTG 

TCTACAACAGAGTGGCAGAGTGGCAGAGCTTAATTAATTAG 

SEQ ID NO 39 (HCCI42) 

GATCCCACAAGCTGTCGTGGACATGGTGGCGGGGGCCCATTGGGGAGTCCTGGCGGG 
CCTCGCCTACTATTCCATGGTGGGGAACTGGGCTAAGGTTTTGGTTGTGATGCTACTCT 



WO 02/055548 



PCT/EP02/00219 



28/65 



Fig. 21 H 

TTGCCGGCGTCGACGGGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCA 
GGGGCCTTGTGTCCCTCTTTAGCCCCGGGTCGGCTCAGAAAATCCAGCTCGTAAACAC 
CAACGGCAGTTGGCACATCAACAGGACTGCCCTGAACTGCAACGACTCCCTCCAAAC 
AGGGTTCTTTGCCGCACTATTC7ACAAACACAAATTCAACTCGTCTGGATGCCCAGAG 
CGCTTGGCCAGCTGTCGCTCCATCGACAAGTTCGCTCAGGGGTGGGGTCCCCTCACTT 
ACACTGAGCCTAACAGCTCGGACCAGAGGCCC7ACTGCTGGCACTACGCGCCTCGACC 
GTGTGGTATTGTACCCGCGTCTCAGGTGTGCGGTCCAGTGTATTGCTTCACCCCGAGCC 
CTGTTGTGGTGGGGACGACCGATCGGTTTGGTGTCCCCACGTATAACTGGGGGGCGAA 
CGACTCGGATGTGCTGATTCTCAACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGC 
TGTACATGGATGAATGGCACTGGGTTCACCAAGACGTGTGGGGGCCCCCCGTGCAACA 
TCGGGGGGGCCGGCAACAACACCTTGACCTGCCCCACTGACTGTTTTCGGAAGCACCC 
CGAGGCCACCTACGCCAGATGCGGTTCTGGGCCCTGGCTGACACCTAGGTGTATGGTT 
CATTACCCATATAGGCTCTGGCACTACCCCTGCACTGTCAACTTCACCATCTTCAAGGT 
TAGGATGTACGTGGGGGGCGTGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCG 
AGGAGAGCGTTGTGAC7TGGAGGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTG 
TCTACAACAGGTGATCGAGGGCAGACACCATCACCACCATCACTAATAG ■ 

SEQ ID NO 41 (HCCI43) 

ATGGTGGGGAACTGGGCTAAGGTTTTGGTTGTGAtGCTACTCTTTGCCGGCGTCGACG 
GGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCAGGGGCCTTGTGTCCCT 
CTTTAGCCCCGGGTCGGCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGTTGGCAC 
ATCAACAGGACTGCCCTGAACTGCAACGACTCCCTCCAAACAGGGTTCTTTGCCGCAC 
TATTCTACAAACACAAATTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCG 
CTeCATCGACAAGTTCGCTCAGGGGTGGGGTCCCCTCACTTACACTGAGCCTAACAGC 
TCGGACCAGAGGCCCTACTGCTGGCACTACGCGCCTCGACCGTGTGGTATTGTACCCG 
CGTCTCAGGTGTGCGGTCCAGTGTATTGCTTCACCCCGAGCCCTGTTGTGGTGGGGAC 
GACCGATCGGTTTGGTGTCCCCACGTATAACTGGGGGGCGAACGACTCGGATGTGCTG 
ATTCTCAACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGCTGTACATGGATGAATG 
GCACTGGGTTCACCAAGACGTGTGGGGGCCCCCCGTGCAACATCGGGGGGGCCGGCA 
ACAACACCTTGACCTGCCCCACTGACTGTTTTCGGAAGCACCCCGAGGCCACCTACGC 
CAGATGCGGTTCTGGGCCCTGGCTGACACCTAGGTGTATGGTTCATTACCCATATAGG 
CTCTGGCACTACCCCTGCACTGTCAACTTCACCATCTTCAAGGTTAGGATGTACGTGGG 
GGGCGTGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGA 
CTTGGAGGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTGTCTACAACAGAGTGG 
CAGAGCTTAATTAATTAG • 
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Fig. 211 

SEQ ID NO 43 (HCC144) 

ATGGTGGGGAACTGGGCTAAGGTTTTGGTTGTGATGCTACTCTTTGCCGGCGTCGACG 
GGCATACCCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCAGGGGCCTTGTGTCCCT 
CTTTAGCCCCGGGTCGGCTCAGAAAATCCAGCTCGtAAACACCAACGGCAGTTGGCAC 
ATCAACAGGACTGCCCTGAACTGCAACGACTCCCTCCAAACAGGGTTCTTTGCCGCAC 
TATTCTACAAACACAAATTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCG 
CTCCATCGACAAGTTCGCTCAGGGGTGGGGTCCCCTCACTTACACTGAGCCTAACAGC 
TCGGACCAGAGGCCCTACTGCTGGCACTACGCGCCTCGACCGTGTGGTATT.GTACCCG 
CGTCTCAGGTGTGCGGTCCAGTGTATTGCTTCACCCCGAGCCCTGTTGTGGTGGGGAC 
GACCGATCGGTTTGGTGTCCCCACGTATAACTGGGGGGCGAACGACTCGGATGTGCTG 
ATTCTCAACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGCTGTACATGGATGAATG 
GCACTGGGTTCACCAAGACGTGTGGGGGCCCCCCG7GCAACATCGGGGGGGCCGGCA 
ACAACACCTTGACCTGCCCCACTGACTGTTTTCGGAAGCACCCCGAGGCCACCTACGC 
CAGATGCGGTTCTGGGCCCTGGCTGACACCTAGGTGTATGGTTCATTACCCATATAGG 
CTCTGGCACTACCCCTGCACTGTCAACTTCACCATCTT-CAAGGTTAGGATGTACGTGGG 
GGGCGTGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGA 
CTTGGAGGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTGTCTACAACAGGTGAT 
CGAGGGCAGACACCATCACCACCATCACTAATAG 

SEQ ID NO 45 (HCCL64) 

ATGGTGGCGGGGGCCCATTGGGGAGTCCTGGCGGGCCTCGCCTACTATTCCATGGTGG 
GGAACTGGGCTAAGGTTTTGGTTGTGATGCTACTCT7TGCCGGCGTCGACGGGCATAC 
CCGCGTGTCAGGAGGGGCAGCAGCCTCCGATACCAGGGGCCTTGTGTCCCTCTTTAGC 
CCCGGGTCGGCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGTTGGCACATCAAC 
AGGACTGCCCTGAACTGCAACGACTCCCTCCAAACAGGGTTCTTTGCCGCACTATTCT 
ACAAACACAAATTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCGCTCCAT 
CGACAAGTTCGCTCAGGGGTGGGGTCCCCTCACTTACACTGAGCCTAACAGCTCGGAC 
CAGAGGCCCTACTGCTGGCACTACGCGCCTCGACGGTGTGGTATTGTACCCGCGTCTC 
AGGTGTGCGGTCCAGTGTATTGCTTCACCCCGAGCCCTGTTGTGGTGGGGACGACCGA 
TCGGTTTGGTGTCCCCACGTATAACTGGGGGGCGAACGACTCGGATGTGCTGATTCTC 
AACAACACGCGGCCGCCGCGAGGCAACTGGTTCGGCTGTACATGGATGAATGGCACT 
GGGTTCACCAAGACGTGTGGGGGCCCCCCGTGCAACATCGGGGGGGCCGGCAACAAC 
ACCTTGACCTGCCCCACTGACTGTTTTCGGAAGCACCCCGAGGCCACCTACGCCAGAT 
GCGGTTCTGGGCCCTGGCTGACACCTAGGTGTATGGTTCATTACCCATATAGGCTCTGG 
CACTACCCCTGCACTGTCAACTTCACCATCTTCAAGGTTAGGATGTACGTGGGGGGCG 
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Fig. 21J 

TGGAGCACAGGTTCGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGACTTGGA 

GGACAGGGATAGATCAGAGCTTAGCCCGCTGCTGCTGTCTAGAACAGAGTGGCAGATA 

CTGCCCTGTTCCTTCACCACCCTGCCGGCCCTATCCACCGGCCTGATCCACCTCCATCA , 

GAACATCGTGGACGTGCAATACCTGTACGGTGTAGGGTCGGCGGTTGTCTCCCTTGTC 

ATCAAATGGGAGTATGTCCTGTTGCTCTTCCTTCTCCTGGCAGACGCGCGCATCTGCGC 

CTGCTTATGGATGATGCTGCTGATAGCTCAAGCTGAGGCCGCCTTAGAGAACCTGGTG 

GTCCTCAATGCGGCGGCCGTGGCCGGGGCGCATGGCACTCTTTCCTTCCTTGTGTTCTT 

CT.GTGCTGCCTGGTACATCAAGGGCAGGCTGGTCCCTGGTGCGGCATACGCCTTCTAT 

GGCGTGTGGCCGCTGCTCCTGCTTCTGCTGGCCTTACCACCACGAGC7TATGCCTAGTAA 

SEG ID NO 4.7 {HCCI65) 

AATTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGGGGTACA 

TTCCGC7CGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGGCGTCCG 

GGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATTTGCCCGGTTGCTCTTTCTCT 

ATCTTCCTCT7GGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAAGTGCG 

CAACGTGTCCGGGATGTACCATGTCACGAACGACTGCTCCAACTCAAGCATTGTGTAT 

GAGGCAGCGGACATGATCATGCACACCCCCGGGTGCGTGCCCTGCGTTCGGGAGAAC 

AACTCTTCCCGCTGCTGGGTAGCGCTCACCCCCACGCTCGCAGCTAGGAACQCCAGCG 

TCCCCACCACGACAATACGACGCCACGTCGATTTGCTCGTTGGGGCGGCTGCTTTCTG 

TTCCGCTATGTACGTGGGGGACCTCTGCGGATCTGTCTTCCTCGTCTCCCAGCTGTTCA 

CCATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATCCCGG 

CCACATAACGGGTCACCGTATGGCTTGGGATATGATGATGAACTGGTCGCCTACAACG 

GCCCTGGTGGTATCGCAGCTGCTCCGGATCCCACAAGCTGfCGTGGACATGGTGGCGG 

GGGCCCATTGGGGAGTCCTGGCGGGCCTCGCCTACTATTCCATGGTGGGGA'ACTGGGC 

TAAGGTTTTGGTTGTGATGCTACTCTTTGCCGGCGTCGACGGGCATACCCGCGTGTCAG 

GAGGGGCAGCAGCCTCCGATACCAGGGGCCTTGTGTCCCTCTTTAGCCCCGGGTCGGC 

TCAGAAAATCCAGCTCGTAAACACCAACGGCAGTTGGCACATCAACAGGACTGCCCT 

GAACTGCAACGACTCCCTCCAAACAGGGTTCTTTGCCGCACTATTCTACAAACACAAA 

TTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCGCTCCATCGACAAGTTCG • 

CTCAGGGGTGGGGTCCCCTCACTTACACTGAGCCTAACAGCTCGGACCAGAGGCCCTA 

CTGCTGGCACTACGCGCCTCGACCGTGTGGTATTGTACCCG.CGTCTCAGGTGTGCGGT 

CCAGTGTATTGCTTCACCCCGAGCCCTGTTGTGGTGGGGACGACCGATCGGTTTGGTGT 

CCCCACGTATAACTGGGGGGCGAACGACTCGGATGTGCTGATTCTCAACAACACGCGG 

CCGCCGCGAGGCAACTGGTTCGGCTGTACATGGATGAATGGCACTGGGTTCACCAAGA 

CGTGTGGGnGr.CCCCCGTGCAACATCGGGGGGGCCGGCAACAACACCTTGACCTGCC 
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Fig. 21 K 

CCACTGACTGTTTTCGGAAGCACCCCGAGGCCACCTACGCCAGATGCGGTTCTGGGCC 
CTGGCTGACACCTAGGTGTATGGTTCATTACCCATATAGGCTCTGGCACTACCCCTGCA 
CTGTCAACTTCACCATCTTCAAGGT7AGGATGTACGTGGGGGGCGTGGAGCACAGGTT 
CGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGACTTGGAGGACAGGGATAG 
A'TCAGAGCTTAGCCCGCTGCTGCTGTCTACAACAGAGTGGCAGATACTGCCCTGTTCC 
TTCACCACCCTGCCGGCCCTATCCACCGGCCTGATCCACCTCCATCAGAACATCGTGG 
ACGTGCAATACCTGTACGGTGTAGGGTCGGCGGT7GTCTCCCTTGTCATCAAATGGGA 
GTATGTCCTCTTGCTCTTCCTTCTCCTGGCAGACGCGCGCATCTGCGCCTGCTTATGGA 
TGATGCTGCTGATAGCTCAAGCTGAGGCCGCCTTAGAGAACCTGGTGGTCCTCAATGC 
GGCGGCCGTGGCCGGGGCGCATGGCACTCTTTCCTTCCTTGTGTTCTTCTGTGCTGCCT 
GGTACATCAAGGGCAGGCTGGTCCCTGGTGCGGCATACGCCTTCTATGGCGTGTGGCC 
GCTGCTCCTGCTTCTGCTGGCCTTACCACCACGAGCTTATGCCTAGTAAGCTT 

SEG ID NO 49 (HCCI66) 

ATGAGCACGAATCCTAAACCTCAAAGAAAAACCAAACGTAACACCAACCGCCGCCCA 
CAGGACGTCAAGTTCCCGGGCGGTGGTCAGATCGTTGGTGGAGTTTACOTGTTGCCGC 
GCAGGGGCCCCAGGTTGGG7GTGCGCGCGACTAGGAAGACTTCCGAGCGGTCGCAAC ' 
CTCGTGGGAGGCGACAACCTATCCCCAAGGCTCGCCGACCCGAGGGTAGGGCCTGGG 
CTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCATGGGGTGGGCAGGATG 
GCTCCTGTCACCCCGCGGCTCTCGGCCTAGTTGGGGCCCTACAGACCCCCGGCGTAGG 
TCGCGTAATTTGGGTAAGGTCATCGATACCCTTACATGCGGCTTCGCCGACCTCGTGG 
GGTACATTCCGCTCGTCGGCGCCCCCCTAGGGGGCGCTGCCAGGGCCCTGGCGCATGG 
CGTCCGGGTTCTGGAGGACGGCGTGAACTATGCAACAGGGAATT7GCCCGGTTGCTCT 
TTCTCTATCTTCCJCTTGGCTTTGCTGTCCTGTCTGACCGTTCCAGCTTCCGCTTATGAA 

gtgcgcaacgtgtccgggatgtaccatgtcacgaacgactgctccaactcaagcattg 
tgtatgaggcagcggacatgatcatgcacacccccgggtgcgtgccctgcgttcggga 
gaacaactcttcccgctgctgggtagcgctcacccccacgctcgcagctaggaacgcc 
agcgtccccaccacgacaatacgacgccacgtcgatttgctcgttggggcggctgctt 
tctgttccgctatgtacgtgggggacctctgcggatctgtcttcctcgtctcccagctg 

TTCACCATCTCGCCTCGCCGGCATGAGACGGTGCAGGACTGCAATTGCTCAATCTATC 
CCGGCCACATAACGGGTCACCGTATGGCTTGGGATATGATGATGAACTGGTCGCCTAC 
AACGGCCCTGGTGGTATCGCAGCTGCTCCGGATCCCACAAGCTGTCGTGGACATGGTG 
GCGGGGGCCCATTGGGGAGTCCTGGCGGGCCTCGCCTACTATTCCATGGTGGGGAACT 
GGGCTAAGGTTTTGGTTGTGATGCTACTCTTTGCCGGCGTCGACGGGCATACCCGCGT 
GTCAGGAGGGGCAGCAGCCTCCGATACCAGGGGCCTTGTGTCCCTCTTTAGCCCCGGG 
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Fig. 21 L 

TCGGCTCAGAAAATCCAGCTCGTAAACACCAACGGCAGTTGGCACATCAACAGGACT 
GCCCTGAACTGCAACGACTCCCTCCAAACAGGGTTCTTTGCCGCACTATTCTACAAAC 
ACAAATTCAACTCGTCTGGATGCCCAGAGCGCTTGGCCAGCTGTCGCTCCATCGACAA 
GTTCGCTCAGGGGTGGGGTCCCCTCACTTACACTGAGCCTAACAGCTCGGACCAGAGG 
CCCTACTGCTGGCACTACGCGCCTCGACCGTGTGGTATTGTACCCGCGTCTCAGGTGT 
GCGGTCCAGTGTATTGCTTCACCCCGAGCCCTGTTGTGGTGGGGACGACCGATCGGTT 
TGGTGTCCCCACGTATAACTGGGGGGCGAACGACTCGGATGTGCTGATTCTCAACAAC 
ACGCGGCCGCCGCGAGGCAACTGGTTCGGCTGTACATGGATGAATGGCACTGGGTTCA 
CCAAGACGTGTGGGGGCCCCCCGTGCAACATCGGGGGGGCCGGCAACAACACCTTGA 
CCTGCCCCACTGACTGTT7TCGGAAGCACCCCGAGGCCACCTACGCCAGATGCGGTTC 
TGGGCCCTGGCTGACACCTAGGTGTATGGTTCATTACCCATATAGGCTCTGGCACTAC 
CCCTGCACTGTCAACTTCACCATCTTCAAGGTTAGGATGTACGTGGGGGGCGTGGAGC 
ACAGGTTCGAAGCCGCATGCAATTGGACTCGAGGAGAGCGTTGTGACTTGGAGGACA 
GGGATAGATCAGAGCTTAGCCCGCTGCTGCTGTCTACAACAGAGTGGCAGATACTGCC 
CTGTTCCTTCACCACCCTGCCGGCCCTATCCACCGGCCTGATCCACCTCCATCAGAAC 
ATCGTGGACGTGCAATACCTGTACGGTGTAGGGTCGGCGGTTGTCTCCCTTGTCATCA 
AATGGGAGTATGTCCTGTTGCTCTTCCTTCTCCTGGCAGACGCGCGCATCTGCGCCTGC 
TTATGGATGATGCTGCTGATAGCTCAAGCTGAGGCCGCCTTAGAGAACCTGGTGGTCC 
TCAATGCGGCGGCCG7GGCCGGGGCGCATGGCACTCTTTCCTTCCTTGTGTTCTTCTGT 
GCTGCCTGGTACATCAAGGGCAGGCTGGTCCCTGGTGCGGCATACGCCTTCTATGGCG 
TGTGGCCGCTGCTCCTGCTTCTGCTGGCCTTACCACCACGAGCTTATGCCTAGTAA 
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Figure 24 
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Fig. 38 

Relative Map Positions of 
anti-E2 monoclonal antibodies 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Ma er tens, Geerc 
Bosman , Pons 
De Martynoff, Guy 
Buyse, Marie -Ange 

(ii) TITLE OF INVENTION: Purified hepatitis C virus envelope 
proteins for diagnostic and therapeutic use. 



(iii) NUMBER OF SEQUENCES: 111 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE 

(B) STREET: 1100 North Glebe Road, 8th Floor 

(C) CITY: Arlington. VA 22201 

(E) COUNTRY: USA 

(F) ZIP: 22201 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(3) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE : Pa ten tin Release £1.0, Version #1.25 

(vi) CURRENT APPLICATION DA.TA: 

(A) APPLICATION NUMBER: Unas signed 

(B) FILING DATE: 01-DEC-2 000 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Sadof f, B. J. 

(B) REGISTRATION NUMBER: 36663 

(C) REFERENCE /DOCKET NUMBER: 2551-53 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAAG CTTAATTAAT T 21 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCGGGGAGGC CTGCACGTGA TCGAGGGCAG ACACCATCAC CACCATCACT AATAGTTAAT 60 
TAACTGCA 68 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) .SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE : CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

( B) LOCATION: 1. .539 

(ix) FEATURE: 

(A) NAME/KEY: mac_peptide 

(B) LOCATION: 1. .636 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA CTG TCC TGT 48 

Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
1 5 10 " 15 

CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG TCC GGG ATG- 96 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG TAT GAG GCA 144 

Tyr His Val Thr Asn As? Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
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35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG CTC GCA GCT 240 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 

85 90 95 

CTC GTT GGG GCG GCT GCT CTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 ~ 110 

TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC TCG CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He Ser Pro Arg 

115 120 125 

CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC . 432 

Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 

130 135 140 

ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT 480 

He Thr Gly Kis Arg Met Ala Trp Asp Met Met Met Asn Tro Ser Pro 

145 150 155 150 

ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC 528 

Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 

165 170 175 

GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC 576 

Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 

180 185 190 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA 624 

Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 

195 200 205 

CTC TTT GCT CTC TAATAG 642 
Leu" Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino' acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
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20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu ?he Thr He Ser Pro Arg 
115 120 ' 125 

Arg His Glu Thr Val Gin Aso Cys Asn Cys Ser lie Tyr Pro Gly His 
130 135 140 

He Thr Gly His Arg Met Ala Tro Aso Met Met Met Asn Trp Ser Pro 
145 150 155 160 

Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
165 170 175 

Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
180 185 190 

Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu lie Val Met Leu 
195 200 205 

Leu Phe Ala Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..792 

(ix) FEATURE: 

(A) NAME /KEY: mat_oeptide 

(B) LOCATION: 1..789 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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TGC 


TCA 


ATC 


TAT 


576 


Ser 


Pro Arg 


Arg 


His 


Glu 


Thr 


Val 


Glh 


Asp 


Cys 


Asn 


Cys 


Ser 


lie 


•Tyr 








180 










185 










190 








CCC 


GGC CAC 


ATA 


ACG 


GGT 


CAC 


CGT 


ATG 


GCT 


TGG 


GAT 


ATG 


ATG 


ATG 


AAC 


624 


Pro 


Gly His 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 






195 










200 










205 










TGG 


TCG CCT 


ACA 


ACG 


GCC 


CTG 


GTG 


GTA 


TCG 


CAG 


CTG 


CTC 


CGG 


ATC 


CCA 


672 


Trp 


Ser Pro 


Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 






210 








215 










220 












CAA 


GCT GTC 


GTG 


GAC 


ATG 


GTG 


GCG 


GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


720 


Gin 


Ala Val 


Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp Gly 


Val 


Leu 


Ala 




225 








230 










235 










240 




GGT 


CTC GCC 


TAC 


TAT 


TCC 


ATG 


GTG 


GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


ATT 


768 
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Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu lie 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 795 
Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID MO: 6: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 263 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu GIu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn As? Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cvs Val Pro Cvs 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cvs Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Aso Met Met Met Asn 
195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
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225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Mec Val Gly Asn Tro Ala Lys Val Leu He 
245 250 255 

Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID MO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME / KEY : mat_pepcide 

(B) LOCATION: 1..627 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 
Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asd Gly Val Asn Tyr Ala 
35 40 45 

AC A GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 , 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 3 36 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 
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GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACT ACG AC A ATA CGA CGC CAC 432 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Mec Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 

Ser Pro Arg Arg His Glu Thr Val Gin As? Cys Asn Cys . Ser lie Tyr 
180 185 190 

CCC GGC CAC ATA AC A GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

Pro Gly His lie Thr Gly His Arg Mec Ala Trp Asp Met Met Mec Asn 

195 200 205 

TGG TAATAG 633 
Trp 

210 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 209 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly "Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 
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Val Arg Glu Asn Asn Ser Ser Arg Cys Tra Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp 



C2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .480 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1 . . 477 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
1 5 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 
Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 
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GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 • 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 

Val Ser Gly His Arg Met Ala Tro Asp Met Met Met Asn Trp Ser 
145 150 155 160 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 
35 40 45 

Ala Asp Met lie Mec His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Aso Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His 
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130 



135 



140 



Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Tip Ser 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 
{B) LOCATION: 1. .477 

(ix) FEATURE: 

(A) NAME /KEY : mac .peptide 

(B) LOCATION: 1. .474 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG TCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 

Met Ser Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 

1.5 10 15 

CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 96 

Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 

20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 144 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr Glu Ala 

35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC A.CG CTC GCG GCC 240 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 

85 90 95. 

CTC GTT GGG GCT GCT GCT TTC • TGT TCC GCT ATG TAC GTG GGG GAT CTC 3 36 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
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115 * 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 
Arg His Gin Thr Val Gin Asd Cys Asn Cys Ser lie Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 480 
Val Ser Gly His Arg Met Ala Tip Asp Met Met Met Asn Trp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Aso Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Vai Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 6 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..633 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



ATG 


CTG 


GGT 


AAG 


GCC 


ATC 


GAT 


ACC 


CTT 


ACG 


TGC 


GGC TTC 


GCC 


GAC 


CTC 


48 


Met 


Leu Gly 


Lys 


Ala 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly Phe 


Ala 


Asp 


Leu 




1 








5 










10 








15 






GTG 


GGG 


TAC 


ATT 


CCG 


CTC 


GTC 


GGC 


GCC 


CCC 


CTA 


GGG GGC 


GCT 


GCC 


AGG 


96 


Val 


Gly Tyr 


He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Leu 


Gly Gly 


Ala 


Ala 


Arg 










20 










25 








30 








GCC 


CTG 


GCG 


CAT 


GGC 


GTC 


CGG 


GTT 


CTG 


GAA 


GAC 


GGC GTG 


AAC 


TAT 


GCA 


144 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


ASp 


Gly Val 


Asn 


Tyr 


Ala 








35 










40 








45 










ACA 


GGG 


AAT 


TTG 


CCT 


GGT 


TGC 


TCT 


TTC 


TCT 


ATC 


TTC CTC 


TTG 


GCT 


TTA 


. 192 


Thr 


Gly Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


?he 


Ser 


He 


Phe Leu 


Leu 


Ala 


Leu 






50 










55 










60 










CTG 


TCC 


TGT 


CTA 


ACC 


ATT 


CCA 


GCT 


TCC 


GCT 


TAC 


GAG GTG 


CGC 


AAC 


GTG 


240 


Leu 


Ser Cys 


Leu 


Thr 


He 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu Val 


Arg Asn 


Val 




65 










70 










75 








80 




TCC 


GGG 


ATG 


TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC TCA 


AGC 


ATT 


GTG 


288 


Ser 


Gly Mec 


Tyr 


His 


Val 


Thr 


Asn 


ASp 


Cvs 


Ser 


Asn Ser 


Ser 


He 


Val 












85 










90 








95 






TAT 


GAG 


GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG TGC 


GTG 


CCC 


TGC 


336 


Tyr 


Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly Cys 


Val 


Pro 


Cys 










100 










105 








110 








GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG CTC 


ACC 


CCC 


ACG 


384 


Val 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala Leu 


Thr 


Pro 


Thr 








115 










120 








125 










CTC 


GCG 


GCT 


AGG 


AAC 


GCC 


AGC 


ATC 


CCC 


ACT 


ACA 


ACA ATA 


CGA 


CGC 


CAC 


432 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


He 


Pro 


Thr 


Thr 


Thr He 


Arg Arg 


His 






130 










135 










140 










GTC 


GAT 


TTG 


CTC 


GTT 


GGG 


GCG 


GCT 


GCT 


TTC 


TGT 


TCC GCT 


ATG 


TAC 


GTG 


480 


Val 


Asp 


Leu 


Leu 


Val 


Gly 


Ala 


Ala 


Ala 


Phe 


Cys 


Ser Ala 


Met 


Tyr 


Val 




145 










150 










155 








160 




GGG 


GAT 


CTC 


TGC 


GGA 


TCT 


GTC 


TTC 


CTC 


GTC 


TCC 


CAG CTG 


TTC 


ACC 


ATC 


528 


Gly 


Asp 


Leu 


Cys 


Gly 


Ser 


Val 


Phe 


Leu 


Val 


Ser 


Gin Leu 


Phe 


Thr 


He 












165 










170 








175 






TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACG 


GTG 


CAG 


GAC 


TGC 


AAT TGC 


TCA 


ATC 


TAT 


576 


Ser 


Pro 


Arg 


Arg 


His 


Glu 


Thr 


Val 


Gin 


Asp 


Cys 


Asn Cys 


Ser 


He 


Tyr 
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180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His lie Thr Gly His Arg Mec Ala Trp Asp Met Met Met Asia 
195 200 205 

TGG TAC TAATAG ' 640 
Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Leu Gly Lys Ala lie Aso Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Glv Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Tyr Glu Ala Ala Asp Met lie Mec His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cvs Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Mec Ala Trp Asp Met Met Met Asn 
195 200 205 
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Trp Tyr * 
210 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE ■ DESCRIPTION: SEQ ID NO: 15: 

ATGCCCGGTT GCTCTTTCTC TATCTT 26 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 26 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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CTATTAGGAC CAGTTCATCA TCATATCCCA 30 
{2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTATTACCAG TTCATCATCA TATCCCA 27 
(2) INFORMATION FOR SEQ ID NO: 19: 

(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 3 6 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 
(2) INFORMATION FOR SEQ ID NO: 21: 



36 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..720 

{ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1..717 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



ATG 


TTG 


GGT 


AAG 


GTC 


ATC 


GAT 


ACC 


CTT 


ACA 


TGC 


GGC 


TTC 


GCC 


GAC 


CTC 


48 


Met Leu Gly 


Lys 


Val 


lie 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly 


Phe 


Ala 


Aso 


Leu 




1 








• 5 










10 










15 






GTG 


GGG 


TAC 


ATT 


CCG 


CTC 


GTC 


GGC 


GCC 


CCC 


CTA 


GGG 


GGC 


GCT 


GCC 


AGG 


95 


Val 


Gly 


Tyr 


lie 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Leu 


Gly 


Gly 


Ala 


Ala 


Arg 










20 










25 










30 








GCC 


CTG 


GCG 


CAT 


GGC 


GTC 


CGG 


Qipm 


CTG 


GAG 


GAC 


GGC 


•GTG 


AAC 


TAT 


GCA 


144 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


Gly 


Val 


Asn Tyr 


Ala 








35 










40 










45 










ACA 


GGG 


AAT 


TTG 


CCC 


GGT 


TGC 


TCT 


TIC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCT 


TTG 


192 


Thr Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu 






50 










55 










60 












CTG 


TCC 


TGT 


CTG 


ACC 


GTT 


CCA 


GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC 


GTG 


240 


Leu 


Ser 


Cys 


Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn 


Val 




65 










70 










75 










80 




TCC 


GGG 


ATG. 


TAC 


CAT 


GTC 


ACG 


AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATT 


GTG 


288 


Ser Gly Mec 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val 












85 










90 










95 






TAT 


GAG 


GCA 


GCG 


GAC 


ATG 


ATC 


ATG 


CAC 


ACC 


CCC 


GGG 


TGC 


GTG 


CCC 


TGC 


336 


Tyr Glu Ala 


Ala 


Asp 


Met 


He 


Mec 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys 










100 










105 










110 








GTT 


CGG 


GAG 


AAC 


AAC 


TCT 


TCC 


CGC 


TGC 


TGG 


GTA 


GCG 


CTC 


ACC 


CCC 


ACG 


384 


Val 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp" 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 








115 










120 










125 










CTC 


GCA 


GCT 


AGG 


AAC 


GCC 


AGC 


GTC 


CCC 


ACC 


ACG 


ACA 


ATA 


CGA 


CGC 


CAC 


432 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He 


Arg 


Arg 


His 






130 










135 










140 












GTC 


GAT 


TCC 


CAG 


CTG 


TTC 


ACC 


ATC 


TCG 


CCT 


CGC 


CGG 


CAT 


GAG 


ACG 


GTG 


480 


Val 


Asp 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro 


Arg 


Arg 


His 


Glu 


Thr 


Val 
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145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 

165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT AC A ACG GCC CTG GTG 576 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 

180 185 190 

GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG 624 

Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Aso Met Val Ala 

195 200 1 • 205 

GGG GCC CAT TGG GGA GTC CTG GCG GGT CTC GCC TAC TAT TCC ATG GTG 672 

Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 
210 215 220 

GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA CTC TTT GCT CCC TAATAG 723 

Gly Asn Trp Ala Lys Val Leu He Val Met Leu Leu Phe Ala Pro 

225 230 235 240 



(2) INFORMATION* FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asn Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 ■ 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 



Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
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145 150 155 160 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala 
195 200 205 

Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 
210 215 220 

Gly Asn Trp Ala Lys Val Leu lie Val Mec Leu Leu Phe Ala Pro 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .558 

(ix) FEATURE: 

{A) NAME/ KEY: mat_pepcide 

(B) LOCATION: 1. .555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1^5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Glv Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
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65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asp Ser Gin Leu ?he Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 
Met Ala Trp Aso Mec Met Mec Asn Trp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 ■ • . 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 
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Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asd Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp 
180 185 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO ■ 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..603 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1- .600 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT AC A TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 '75 80 
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TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Mec Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 33 6 

Tyr Glu Ala Ala Asp Met He Mec His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG AC A ATA CGA CGC CAC 43 2 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC GAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Aso Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 " 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Aso Cys Asn Cys Ser lie Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 575 
Met Ala Trp Asd Met Met Mec Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC CTC TAATAG 606 
Val Ser Gin Leu Leu Arg lie Leu 
195 200 

(2) INFORMATION FOR SEQ ID MO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

Val Glv Tyr He Pro Leu Val Glv Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cvs Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 
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Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Mec Asn Tro Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Leu 
195 200 



(2) INFORMATION FOR SEQ ID MO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 
• (B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDKA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..533 

(ix) FEATURE; 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..630 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Aso Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
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Thr Gly Asn Leu Pro Gly Cys Ser ?he Ser lie Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 " 70 75 80 

. TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Aso Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG ' 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Tro Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 
Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser He Tvr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 576 
Met Ala Trp Asp Met Mec Met Asn Tr? Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 624 
Val Ser Gin Leu Leu Arg He Vai He Glu Gly Arg His His His His 
195 . 200 205 

CAT CAC TAATAG 63 6 

His His 
210 

(2) INFORMATION FOR SEQ ID NO : 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
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35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Tyr Glu Ala Ala Asd Mec lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115. 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Vai 
145 150 155 160 

Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Mec Mec Mec Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gla Leu Leu Arg lie Val lie Glu Gly Arg His His His His 
195 200 205 

His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

{B) LOCATION: 1. .627 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1. .624 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
ATG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT CTC ATG 48 
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Met Gly Lys Val He Asp Thr Leu Thr Cys Gly ?he Ala Asd Leu Met 
15 10 15 

GGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 96 
Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

CTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 144 
Leu Ala His Gly Val Arg Ala Leu Glu Asu Gly He Asn Phe Ala Thr 
35 40 45 

GGG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu leu Ala Leu Phe 
50 55 60 

TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 240 
Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC 288 
Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 336 
Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 384 
Gin Asp Glv Asn Thr Ser Thr Cys Tro Trrr Pre Val Thr Pro Thr Val 
115 120 12S 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 432 
Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arc Ser His Val 
130 135 140 

GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 480 
Asp Leu Leu Val Gly Ala Ala Thr Men Cys Ser Ala leu Tyr Val Gly 
145 150 155 160 

GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 
Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA 576 
Pro Arg Arg His Gin Thr Val Gin Thr Cvs Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 
Gly His Leu Ser Gly His Arg Met Ala Trp Asp Mec Me 3 Mec Asn Trp 
195 200 205 

TAATAG 634 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : 



protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Gly Lys Val He Asp Thr Leu Thr Cysdy Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr lie Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Tro Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 HO 

Gin Asp Gly Asn Thr Ser Thr Cys Tro Thr Pro Val Thr Pro Thr Val 
115 120 " 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu -Tyr Val Gly 
145 150 155 160' 

Asp Met Cys Gly Ala Val Phe Leu Val Glv Gin Ala Phe Thr Phe Arg 
165 170 175 

Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asu Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 
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(A) NAME / KEY : CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 48 

Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 

15 10 15 

GGG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 96 

Gly Tyr lie Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 

20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 144 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 

35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 192 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 

50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 240 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 

65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 288 

Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 

85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 

Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 

100 105 110 

ATG ACA GGT AAT GTG AGT AGA TGC TGG GTC CAA ATT ACC CCT ACA CTG 384 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 

115 120 125 

TCA GCC CCG AGC CTC GGA GCA GTC ACG GCT CCT CTT CGG AGA GCC GTT 432 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 

130 135 140- 

GAC TAC CTA GCG GGA GGG GCT GCC CTC TGC TCC GCG TTA TAC GTA GGA 480 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Glv 

145 150 155 160 

GAC GCG TGT GGG GCA CTA TTC TTG GTA GGC CAA ATG TTC ACC TAT AGG 528 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 

165 170 175 

CCT CGC CAG CAC GCT ACG GTG CAG AAC TGC AAC TGT TCC ATT TAC AGT 57 6 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 

180 185 ~ 190 

GGC CAT GTT ACC GGC CAC CGG ATG GCA TGG GAT ATG ATG ATG AAC TGG 624 

Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 

195 200 205 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 208 amino acids 
tB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr lie Pro Leu Val Gly Gly Pro lie Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

Gly He Tyr His Val Thr Asn Asp Cvs Pro Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

Met Thr Gly Asn Val Ser Arg Cys Tro Val Gin He Thr Pro Thr Leu 
115 120 125 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 ~ 160 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
180 185 190 

Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SHQ ID NO: 33: 
TGGGATATGA TGATGAACTG GTC * 23 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
. <D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 
(2) INFORMATION FOR SEQ ID NO: 35: 

. (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1473 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peocide 

(B) LOCATION: 1..1470 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG 



48 
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Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 

15 10 15 

CAG CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC 96 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 

20 25 30 

CAT TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC 144 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 

35 40 45 



TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG 192 

Trp Ala Lys Val Leu Val Val Men Leu Leu Phe Ala Gly Val Asp Gly 
50 55 50 

CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT 240 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser As? Thr Arg Gly Leu 
65 70 75 80 

GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC 238 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC 33 6 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 

• 100 105 110 

TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC 3 84 

Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tvr Lys His Lys Phe 

115 120 125 

AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC 432 

Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp 
130 135 140 

AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC 480 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser 
145 150 155 160 



TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT 528 
Ser Asp Gin. Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG 57 6 

He Val Pro Ala Ser Gin Val Cvs Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT 624 
Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 



AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG 672 

Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
210 215 220 

CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG 72 0 

Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Mec Asn Gly Thr Gly 
225 230 235 240 

TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC 768 

Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly 

245 250 255 
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AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG 315 
Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 
260 265 270 

GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT 864 
Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
275 280 285 

ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC 912 
Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
290 295 300 

TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG 960 
Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
305 310 315 320 

TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG 1008 
Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu 
325 330 335 

GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT .ACA ACA GAG 1056 
Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

TGG CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC 1104 
Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC 1152 
Gly Leu He His Leu His Gin Asn lie Val Astd Val Gin Tyr Leu Tvr 
370 375 ' 380 

GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 
Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 
Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 129 6 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 1344 
Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 
Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 1440 
Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Tro Pro Leu Leu Leu Leu 
465 470 475 480 

CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 147 6 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 490 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
1 5 10 15 

Gin Leu Leu Arg lie Pro Gin Ala Val Val Asp Met Val Ala Glv Ala 
• 20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Aso Gly 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cvs Asn Asd 
100 105 110 

Ser Leu Gin Thr Gly'fche Phe Ala Ala Leu Phe Tyr Lys His Lvs Phe 

115 120 125 

Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Aso 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu- Pro Asn Ser 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
210 215 220 

Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
225 230 235 240 

Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly 
245 250 255 

Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 
260 265 270 

Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
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275 280 285 

Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
290 295 300 

Phe Thr lie Phe Lys Val Arg Mec Tyr Val Gly Gly Val Glu His Arg 
305 310 315 320 

Phe Glu Ala Ala Cys Asn Tro Thr Arg Gly Glu Arg Cys Asp Leu Glu 
325 330 " 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

Trp Gin lie Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu lie His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr 
370 375 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val lie Lys Tro Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 ~ 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lvs Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO : 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNZSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2. .1018 

(ix) FEATURE: 

(A) NAME/ KEY : mat_pe?cide 
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(B) LOCATION: 2.. 1015 

(Xi) SEQUENCE DESCRIPTION: SZQ ID NO: 37: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 46 
He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
1 5 10 15 



GTC 


CTG 






CTC 


GCC 


TAC 


TAT 


TCC 


ATP 
- - - \j 


GTG 


GGG 


AAC 


TGG 


GCT 


A\G 


94 


Val 


Leu. 


Ala 


pi v 


Leu 


Ala 






Ser 


Met 


Val 


Gly 


Asn 




Ala 


Lys 












20 










25 










30 






GTT 


ttp 




PTP 


a tp 


PTA 


PTP 
bib 


TTT 


ppp 


ppp 


PTP 

lJ A b 


pap 


ppp 


par 


app 


ppp 
bub 


142 


Val 


Leu, 


Val 


Vet J. 




Leu 


Leu 


true 


Sis 


Pi 17 


val 


Asp 


PI V 

br±y 


nia 


•1 £1X 


A_g 










35 










40 










45 








ptp 


i \*A 




ppp 


ppa 


PpA 

VabA 


ppp 


TPP 
Ibb, 


P2T 


2.PP 


2PP 


PPP 


PTT 
b 1 1 


PTP 
bib 


TPP 
1 V_b 


p m p 

V b 


J. J u 


wdi 




pl u 


pi v 


Ala 


nld 


a j. a 


Ser 


Asp 


i i- — 


Arg 


p1 v 


Leu 


vax 


Ser 


Leu 








50 










55 










60 










TTT 




ppp 




TPP 
Ibb 


PPT 
\jb 1 


pip 


AAA 


1TP 
Ai b 


P"* P 
b^br 


PTP 

b. 1 b 


PT3. 
bin 


a ap 
AAb 


1PP 

Abb. 


a ap 

AAb 


ppp 




Phe 




Pro 


P.1 V 




n-La 


PI n 
\j J.I1 


Ly s 


— — e 


PI - 


!_<eu 


vol 


Asn 




Asn 


p *; \r 

«-y 






65 










70 










75 












C\\3 4. 


Ibrtj 


par* 
UAL. 


A rpp 
AIL 


a ap 
AAL 


app 


APT 1 
Ab 1 


ppp 

Ijbb 


< br 


A p 


•■nop 


A IP 

AAb 


pap 

b^Ab 


ibb 


pmp 
b. 1 V- 


p> ^ 


nor 

iOO 




Trp 


HIS 




Asn 


Arg 


TV» v 

mr 


A 1 - 


-icb 


Asn 


Cys 


Asn 


Asp 




r mi 


pi n 




o u 


































r\A_..-\ 


pop 


1 ib. 


TTT 




LrbA 


CTA 


* lb. 






pip 

bAb 




TTC 


AAC 


TCG 


TCT 




Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lvs 


His 


Lys 


Phe 


Asn 


Ser 


Ser 












100 










105 










110 






fir* a 


(tpp 




PAP 


ppp 

bbrb 


mmp 

1 lb 


ppp 


A PP 

Aob. 




ppp 

bob 


mpp 
I\-L 


A rpP 
Alb 


GAC 


AAG 


1 1 V, 


ppm 
bb - 




Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 


Lys 


Phe 


Ala 










115 










120 










125 








PaP 
brt.br 


ppp 


rppn 
Ibrbi 


pprp 


ppp 

b,bb, 


pfp 

L. lb 


AC i 


p 
1 Ab, 


Ab 1 


P -N p 


PPT* 
bbl 


asp 
AAb 


app 

Abrb 


TCG 


biAb 


PA P 

bAb 


a "i n 

fi J u 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Giu 


Pro 


Asn 


Ser 


Ser 


Asp 


Gin 








130 










135 










140 










AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


CCC 


478 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 


He 


Val 


Pro 






145 










150 










155 












GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


526 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 


Ser 


Pro 


Val 




160 










165 










170 










175 




GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


574 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 


Asn 


Trp 


Gly 












180 










135 










190 






GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


622 


Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arc 










195 










200 










205 








GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


670 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Tro 


Met 


Asn 


Gly 


Thr 


Gly 


Phe 


Thr 


Lys 








210 










215 










220 










ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


718 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 


Asn 


Asn 


Thr 
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225 230 235 

TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 766 
Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 
240 245 250 255 

GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 
Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 ~ 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 862 
Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 
Phe Lys Val Arg Met Tyr Val Gly Gly Val .Glu His Arg Phe Glu Ala 
290 295 " 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 953 
Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 
320 325 330 " 335 

GGC AGA GCT TAATTA 1021 
Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
1 5 10 15 

Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Tro Ala Lvs Val 
20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asd Gly His Thr Arg Val 
35 40 45 

Ser Gly Gly Ala Ala Ala Ser Aso Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 
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Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asu Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Aso Arg 
305 310 315 " 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 . 330 335 

Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 
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(A) NAME/ KEY: CDS 

(B) LOCATION: 2.. 1032 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 2.. 1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CA.T TGG GGA 46 
lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Aso Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 286 
Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 * 95 

ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT 334 
Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 382 
Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala 
115 120 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 430 
Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 478 
Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro 
145 150 155 

GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 526 
Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
160 165 170 175 

GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 574 
Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 185 190 

GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 622 
Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg 
195 200 205 
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GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC ACT GGG 


TTC 


ACC 


AAG 


670 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Tro 


Met 


Asn 


Gly Thr Gly 


Phe Thr 


Lys 








210 










215 






220 










ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG GCC GGC 


AAC 


AAC 


ACC 


718 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cvs 


Asn 


lie 


Gly 


Gly Ala Gly 


Asn 


Asn 


Thr 






225 










230 








235 










TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC CCC GAG 


GCC 


ACC 


TAC 


766 


Leu 


Thr 


Cvs 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His Pro Glu 


Ala 


Thr 


Tvr 




240 










245 










250 






255 




GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT AGG TGT 


ATG 


GTT 


CAT 


814 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro Arg Cys 


Met 


Val 


His 












260 










265 






270 






TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


T.AC 


CCC 


TGC 


ACT GTC AAC 


TTC 


ACC 


ATC 


862 


Tyr 


Pro 


Tvr 


Aro 


Leu 


TrD 


His ' 


T Vr 


Pro 


Cys 


Thr Val Asn 


Phe 


Thr 


lie 










275 










230 






285 








TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG CAC AGG 


TTC 


GAA 


GCC 


910 


Phe 


Lvs 


Val 


Arcr 


Met 


Tv" r 


Val 


G' v 


Gly 


Val 


Glu His Arg 


Phe 


Glu 


Ala 








290 










295 






300 










GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC TTG GAG 


GAC 


AGG 


GAT 


958 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp Leu Glu 


Asp Arg 


Asp 






305 










310 








315 










AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


CTG 


•TCT 


ACA ACA GGT 


GAT 


CGA 


GGG 


1006 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


leu 


Ser 


Thr Thr Glv 


Asp Arg Gly 




320 










325 










330 






335 




CAG 


ACA 


CCA 


TCA 


CCA 


CCA 


TCA 


CTA 


AT AG 










1034 


Gin 


Thr 


Pro 


Ser 


Pro 


Pro 


Ser 


Leu 

















340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii} MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
35 40 45 

Ser Gly Gly Ala Ala Ala Ser Aso Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 
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Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 * 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 

145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asd Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 
245 250 255 

Arg Cys Gly Ser Glv Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305- 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly Gin 
325 330 335 



Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 945 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..942 

(IX) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1. .939 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT. GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
1 5 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asa 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 
Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC 240 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu ?he Tyr 
65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 288 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 336 
Arg Ser He Asp Lys Phe Ala Gin Gly Tro Gly Pro Leu Thr Tyr Thr 
100 105 110 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 384 
Glu Pro Asn Ser Ser Asn Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 " 125 

CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 432 
Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 480 
Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 , 150 155 160 

GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 
Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 576 
Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Mec 
180 185 190 
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AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


ACG 


TGT 


ppp ppp ppp ppp tpp AAP ZkTP 

V3V3\. LLv, L>Lv7 iut rtAV* 


O Z n 


Asn 


Glv 


Thr 




Phe 


Thr 


Lys 


Thr 




uiy uiy iriO rxu ^ya f\an iits 








195 










200 




205 






ppp 


p.pp 




a ap 
AAL. 


a ap 


app 

ALL 


TTP 


app 


tpp ppp afT pap TPT TTT PPP 


cm 


p.1 v 


pi \i 
uiy 


TV 1 =» 
nla 


Pi vr 


Asn 


Asn 


inr 


Leu 


inr 


Cys Pro Thr Asp Cys Phe Arg 






210 










215 






220 






pap 


PPf 


par* 




ACC 


1 AL. 


ppr* 


AuA 


l*bC GGT TCi GGG CCC TGG CTG 


720 




aid 


Pro 


PT 11 


a 1 a 
nla 


inr 


Tyr 


a j. a 


a r-rr 

Arg 


Cys Gly Ser Gly Pro Trp Leu 




225 










230 








235 240 




apa 


PPT 


app 




a TP 
AiLx 


bl I 


p an -1 
LAi 


i AU 


CCA 


lAi AisG CTC TGG CAC rAC CCC 


ICQ 

/bo 




Pro 


Arg 


Cys 


faec 


VaJL 


"IS 


Tyr 


r'-O 


Tyr Arg Leu Trp His Tyr Pro 




























AL. 1 


GTC 


AAC 


TTC 


ACC 


ATC 


TTC 


ArsG 


GTT AGG ATG TAC GTG GGG GGC 


816 


Cys 


Thr 


Val 


Asn 


Phe 


Thr 


He 


Phe 


Lys 


Val Arg Met Tyr Val Gly Gly 










Zov 










zoo 


270 




GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


GCA 


TGC 


a ar -tpp apT ppa ppa pap PPT 


O O <x 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn Trp Thr Arg Gly Glu Arg 








275 










280 




285 




TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


AGA 


TCA 


GAG CCT AGC CCG CTG CTG CTG 


912 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu Leu Ser Pro Leu Leu Leu 






290 










295 






300 




TCT 


ACA 


ACA 


GAG 


TGG 


CAG 


AGC 


TTA 




AAT TAS 


945 


Ser 


Thr 


Thr 


Glu 


Trp 


Gin 


Ser 


Leu 


He 


Asn 




305 










310 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 42: 







(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: procein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

Gly Val Asp Gly His Thr Arg Val Ser Gly GLy Ala Ala Ala Ser Asp 
20 25 30 

Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 

Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu 
50 55 ' 60 

Asn Cys Asn Asd Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 
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Ser 


lie 


Asp 
100 


Lys 


Phe 


Ala 


bin 


Gly 
105 


Trp Gly 


Pro 


Leu 


i nr 

no 


(TSrr 

tyr 


1 liX 


LjIU 


rlO 


Asn 
115 


Ser 


ber 


Asp 


trin 


Arg 
120 


pro 


Tyr Cys 


Trp 


H.1S 

125 


Tyr 


Aia 


Pro 


Arg 


Pro 
130 


Cys 


Gly 


lie 


Val 


Pro 
135 


Hi, 

Ala 


Ser 


Gin Val 


Cys 
140 


Gly 


Pro 


Val 


Tyr 


Cys 
145 


Pne 


Thr 


Pro 


Ser 


Pro 
150 


Val 


Val 


XT-. T 

Val 


Gly Thr 
155 


Thr 


Asp 


Arg 


Phe 


Gly 
160 


Val 


Pro 


Thr 


Tyr 


Asn 
165 


Trp 


Gly 


Ala 


Asn 


Asp Ser 
170 


Asp 


Val 


Leu 


lie 
175 


Leu 


Asn 


Asn 


Thr 


Arg 
180 


Pro 


Pro 


Arg 


Gly 


Asn 
185 


Trp Phe 


Gly 


Cys 


Thr 
190 


Trp 


Met 


Asn 


Gly 


Thr 
195 


Gly 


Phe 


Thr 


Lys 


Thr 
200 


Cys 


Gly Gly 


Pro 


Pro 
205 


Cys 


Asn 


lie 


Gly 


Gly 
210 


Ala 


Gly 


Asn 


Asn 


Thr 
215 


Leu 


Thr 


Cys Pro 


Thr 
220 


Asp 


Cys 


Phe 


Arg 


Lys 
225 


His 


Pro 


Glu 


Ala 


Thr 
230 


Tyr 


Ala 


Arg 


Cys Gly 
235 


Ser 


Gly 


Pro 


Trp 


Leu 
240 


Thr 


Pro 


Arg 


Cys 


Met 
245 


Val 


His 


Tyr 


Pro 


Tyr Arg 
250 


Leu 


Trp 


His 


Tyr 
255 


Pro 


Cys 


Thr 


Val 


Asn 
260 


Phe 


Thr 


lie 


Phe 


Lys 
265 


Val Arg 


Met 


Tyr 


Val 
270 


Gly 


Gly 


Val 
Cys 


Glu 

Asp 
290 


His 
275 
Leu 


Arg 
Glu 


Phe 
Asp 


Glu 
Arg 


Ala 

Asp 
295 


Ala 
280 
Arg 


Cys 
Ser 


Asn Trp 
Glu Leu 


Thr 

Ser 
300 


Arg 
285 
Pro 


Gly 
Leu 


Glu 
Leu 


Arg 
Leu 


Ser 


Thr 


Thr 


Glu 


Trp 


Gin 


Ser 


Leu 


lie 


Asn 













305 310 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1- .958 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 
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(B) LOCATION: 1. .955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn Trp Ala Lys Val leu Val Val Met Leu Leu Phe Ala 
15 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT ~GG CAC ATC AAC AGG ACT GCC CTG 192 
Gin Leu Val Asn Thr Asn Gly Ser Tro His He Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC . 240 
Asn Cys Asn Asp Ser Leu Gin Thr Sly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 288 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG Z-C-C- TGG GGT CCC CTC ACT TAC ACT 335 
Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 i:3 HO 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 384 
Glu Pro Asn Ser Ser Asp Gin Arr Pro Tyr Cys Trp His Tyr Ala Pro 
115 122 125 

CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 432 
Arg Pro Cys Gly He Val Pro Ala Ssr Gin Val Cys Gly Pro Val Tyr 
130 135 140 

TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 480 
Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 
Val Pro Thr Tyr Asn Trp Gly Ala Asn Aso Ser Asp Val Leu He Leu 
165 170 175 

AAC AAC ACG CGG CCG CCG CGA -GGC AAC TGG TTC GGC TGT ACA TGG ATG 576 
Asn Asn Thr Arg Pro Pro Arc Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 135 190 

AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 624 
Asn Gly Thr Gly Phe Thr Lys Thr Cvs Gly Gly Pro Pro Cys Asn He 
195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 672 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 720 
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Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 7 68 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 816 
Cys Thr Val Asn Phe Thr lie Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 864 
Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AG A TCA GAG CTT AGC CCG.CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 958 
Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

TAG 961 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
1 5 10 15 

Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asu 
20 25 30 

Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 * 40 45 

Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu 
50 55 60 

Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 HO 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 ■ 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
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130 135 140 

Cys ?he Thr Pro Ser Pro Val Val Val Gly Thr Thr Aso Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Aso Ser Aso Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Tro Phe Glv Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Aso Cys Phe Arg 
210 215 . 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Tro His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Me- Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cvs Asn Tro Thr Arg Gly Glu Arg 
275 230 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Aso Arg Glv Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1. .1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT 
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Met Val Ala Gly Ala His Tro Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT 96 
Ser Met Val Gly Asn Tro Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC 144 
Ala Gly Val Asp Gly His Thr Arg Val Ser Glv Glv Ala Ala Ala Ser 
35 40 45 

GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT- CAG AAA 192 
Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 .60 

ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC 240 
lie Gin Leu Val Asn Thr Asn Gly Ser Tro His lie Asn Arg Thr Ala 
65 70 75 80 

CTG AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC 2 88 

Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC 33 6 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC 3 84 

Cys Arg Ser lie Aso Lvs Phe Ala Gin Gly Tro Gly Pro Leu Thr Tyr 
115 120 125 

ACT GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG 43 2 

Thr Glu Pro Asn Ser Ser Aso Gin Arg Pro Tyr Cvs Trp His Tyr Ala 
130 135 140 

CCT CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG 430 
Pro Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

TAT TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT 52 8 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Glv Thr Thr Aso Arg Phe 
165 170 175 

GGT GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT 57 6 

Gly Val Pro Thr'Tvr Asn Tro Gly Ala Asn Aso Ser Asp Vai Leu He 
180 185 " 190 

CTC AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG 624 
Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
195 200 205 

ATG AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC 672 
Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Glv Pro Pro Cvs Asn 
210 " 215 ' 220 

ATC GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT 720 
He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Aso Cys Phe 
225 230 235 240 

CGG AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG 7 68 

Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
245 250 255 
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CTG ACA CCT AGG TGT ATG GTT CAT TAG CCA TAT AGG CTC TGG CAC TAC 816 
Leu Thr Pro Arg Cys Met Vai Kis Tyr Pro Tyr Arg Leu Trp His Tyr 
260 265 270 

CCC TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG 864 
Pro Cys Thr Val Asn Phe Thr lie ?he Lys Val Arg Met Tyr Val Gly 
275 280 285 

GGC GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG 912 
Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG 960 
Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 " 310 315 320 

CTG TCT ACA ACA GAG TGG CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG 1008 
Leu Ser Thr Thr Glu Trp Gin He leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC 1056 
Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Vai Asp 
340 345 350 

GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 1104 
Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 1152 
Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
-370 "* 375 380 

ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 1200 
He Cys Ala Cys Leu Trp Mec Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

GCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 1248 
Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 
405 410 415 

CAT GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC 1296 
His Gly Thr Leu Ser Phe Leu Vai Phe Phe Cys Ala Ala Trp Tyr He 
420 425 430 

AAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG ' 13 44 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 . 445 

CCG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 13 95 
Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 

{2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 

20 25 30 

Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 
50 55 60 

lie Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 

Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cys Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Tro His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Aso Arg Phe 
165 170 175 

Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Aso Val Leu lie 
180 185 * 190 

Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
195 200 205 

Met Asn Gly Thr Gly Phe Thr Lys Thr Cvs Gly Gly Pro Pro Cys Asn 
210 215 220 

He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
225 230 235 240 

Arg Lys His Pro Glu Ala Thr Tyr Ala Arcr Cys Gly Ser Gly Pro Trp 
245 250 255 

Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Tro His Tyr 
260 265 270 

Pro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Mer Tyr Val Gly 
275 280 285 

Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 
290 295 300 

Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
305 310 315 320 
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Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 . 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 
405 410 415 

His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Tro Tyr He 
420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 445 

Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2079 

(ix) FEATURE: 

(A) NAME/ KEY: mat_peptide 

(B) LOCATION: 1. .2076 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 
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GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

Thr Gly Asn Leu Pro Gly Cys Ser ?he Ser lie Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 

85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 

130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 

145 150 155 160 

GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 

165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 

Ser Pro Arg Arg His Glu Thr Val Gin Aso Cys Asn Cys Ser He Tyr 

180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 

Pro Gly His He Thr Gly His Arg Met Ala Trp Aso Met Met Met Asn 

195 200 205 

TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 672 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg lie Pro 

210 215 220 

CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 

Gin Ala Val Val Asp Met Val Ala Giy Ala His Trp Gly Val Leu Ala 

225 230 235 240 

GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 768 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 

' 245 250 255 

GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 316 

Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 

260 265 270 



GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
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275 280 285 



GGG TCG GCT 


CAG 


AAA ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


912 


Gly Ser Ala 


Gin 


Lys He 


Gin 


Leu 


Val 


Asn 


Thr Asn Gly 


Ser 


Trp 


His 




290 






295 










300 










ATC AAC AGG 


ACT 


GCC CTG 


AAC 


TGC 


AAC 


GAC 


TCC 


CTC 


GAA 


ACA 


GGG 


TTC 


960 


He Asn Arg 


Thr 


Ala Leu 


Asn Cys 


Asn 


Asp 


Ser 


Leu 


Gin 


Thr 


Gly 


Phe 




305 




310 










315 








Ton 

J c. U 




TTT GCC GCA 


CTA 


TTC TAC 


AAA 


CAC 


AAA 


TTC 


AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


1008 


Phe Ala Ala 


Leu 


Phe Tyr 


Lys 


His 


Lys 


Phe 


Asn 


Ser 


Ser 


Gly 


Cys 


Pro 








325 








330 










J J J 






GAG CGC TTG 


GCC 


AGC TGT 


CGC 


TCC 


ATC 


GAC 


AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


1056 


Glu Arg Leu 


Ala 


Ser Cys 


Arg Ser 


He 


Asp 


Lys 


Phe Ala 


Gin 


Gly 


Trp 






340 








345 










350 








GGT CCC CTC 


ACT 


TAC ACT 


GAG 


CCT 


AAC 


AGC 


TCG 


GAC 


CAG 


AGG 


CCC 


TAC 


1104 






TK.tt* Tin Y* 


Glu 


Pro 


Asn 


Ser 


Ser Asp 


Gin 


Arg 


Pro 


Tyr 




355 








360 










365 










TGC TGG CAC 


TAC 


GCG CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


CCC 


GCG 


TCT 


CAG 


1152 


iiu nj,s 


Tyr 


Ala Pro 


Arg 


Pro 


Cys 


CI \f 

vjiy 


He 


Val 


Pro 


Ala 


Ser 


Gin 




370 






375 










380 












GTG TGC GGT 


CCA 


GTG TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


GTG 


GTG 


GGG 


1200 


vax \jxy 


Pro 


Vcii A jr^ 


Cys 


Phe 




r^O 


Ser 


Pro 


Val 


val 


val 


Gly 




385 




390 










395 










400 




ACG ACC GAT 


CGG 


TTT GGT 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


GCG 


AAC 


GAC 


1248 


Thr Thr Asp 


Arg 


Phe Gly 


Val 


Pro 


Thr 


Tyr 


Asn Trp Gly 


Ala 


Asn 


Asp 








- U J 








a~\ n 

ft JLU 










41b 






TCG GAT GTG 


CTG 


ATT CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


GGC 


AAC 


TGG 


1296 


Ser Asp Val 


Leu 


He Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arg 


Gly 


Asn 


Trp 






420 


















a i n 
ft J u 








TTC GGC TGT 


ACA 


TGG ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


ACG 


TGT 


GGG 


1344 


Phe Gly Cys 


Thr 


Trp Met 


Asn Gly 


Thr 


Gly 


Phe 


Thr 


Lys 


Thr 


Cys 


Gly 




a~k^ 

ft J D 








440 










445 










GGC CCC CCG 


TGC 


AAC ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


TTG 


ACC 


TGC 


1392 


Gly Pro Pro 


Cys 


Asn He 


Gly Gly 


Ala 


Gly 


Asn 


Asn 


Thr 


Leu 


Thr 


Cys 




T J U 






455 










460 










CCC ACT GAC 


TGT 


TTT CGG 


AAG 


CAC 


CCC 


GAG 


GCC 


ACC 


TAC 


GCC 


AGA 


TGC 


1440 


Pro Thr Asp 


Cys 


Phe Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 


Ala 


Arg 


Cys 




465 




470 










475 










480 




GGT TCT GGG 


CCC 


TGG CTG 


ACA 


CCT 


AGG 


TGT 


ATG 


GTT 


CAT 


TAC 


CCA 


TAT 


1488 


Gly Ser Gly 


Pro 


Trp Leu 


Thr 


Pro 


Arg 


Cvs 


Met 


Val 


His 


Tyr 


Pro 


Tyr 








485 








490 










495 




AGG CTC TGG 


CAC 


TAC CCC 


TGC 


ACT 


GTC 


AAC 


TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


1536 


Arg Leu Trp 


His 


Tyr Pro 


Cys 


Thr 


Val 


Asr. 


Phe 


Thr' 


lie 


Phe 


Lys 


Val 






500 








505 










510 






AGG ATG TAC 


GTG 


GGG GGC 


GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


1584 


Arg Met Tyr 


Val 


Gly Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 




515 








520 










525 











TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG 
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Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

CTT AGC CCG CTG CTG CTG TCT AC A ACA GAG TGG CAG ATA CTG CCC TGT 1680 
Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Tro Gin lie Leu Pro Cvs 
545 550 555 560 

TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT 1728 
Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu lie His Leu Kis 
565 570 575 

CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT 1776 
Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 

GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC L824 
Val Ser Leu Val lie Lys Tro Glu Tyr Val Leu Leu Leu Phe Leu Leu 
595 600 605 

CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA 1872 
Leu Ala Asp Ala Arg lie Cys Ala Cys Leu Trp Met Met Leu Leu He 
610 615 620 

GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG 1920 
Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
625 630 635 640 

GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT 1968 
Ala Val Ala Gly Ala Kis Gly Thr Leu Ser Phe Leu Val Phe Phe Cys . 

645 650 655 

GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC 2016 
Ala Ala Trp Tyr lie Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA 20 64 

Phe Tyr Gly val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

CGA GCT TAT GCC TAGTAA 2082 
Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 692 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
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35 



40 



45 



Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Tro Val Ala Leu Thr Pro Thr 

115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 

130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cvs Ser Ala Met Tyr Val 

145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr lie 



Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser lie Tyr 

180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Tro Asp Met Met Met Asn 

195 200 205 

Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 

210 215 220 

Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 

225 230 235 240 

Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 250 255 

Val Met Leu Leu Phe Ala Gly Val Asd Glv His Thr Arg Val Ser Gly 

260 265 270 

Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 

275 280 285 

Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 

290 295 300 

He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 

305 310 315 320 

Phe Ala Ala Leu Phe Tyr Lys His Lvs Phe Asn Ser Ser Gly Cys Pro 
325 " 330 335 

Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala Gin Gly Trp 

340 345 350 

Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 

355 360 365 

Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala Ser Gin 



165 



170 



175 
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370 375 380 

Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val Val Gly 
385 390 395 400 

Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asd 
405 410 415 

Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
420 425 430 

Phe Gly Cys Thr Trp Mec Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 
435 440 445 

Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys Kis Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He Phe Lys Val 
500 505 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
545 550 555 560 

Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
565 570 575 

Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 

Val Ser Leu Val He Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 
595 600 605 

Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 
610 615 620 

Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
625 630 635 640 

Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
645 650 655 

Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 
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(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2430 

(ix) FEATURE: 

(A) NAME /KEY : mac_oeptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ATG 


AGC 


ACG 


AAT 


CCT 


AAA 


CCT 


CAA AGA 


AAA 


ACC 


AAA CGT 


AAC 


ACC 


AAC 


48 


Met 


Ser 


Thr 


Asn 


Pro 


Lys 


Pro 


Gin Arg Lys 


Thr 


Lys Arg 


Asn 


Thr 


Asn 




1 








5 








10 








15 






CGC 


CGC 


CCA 


CAG 


GAC 


GTC 


AAG 


TTC CCG 


GGC 


GGT 


GGT CAG 


ATC 


GTT 


GGT 


96 


Arg 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe Pro 


Gly 


Gly 


Gly Gin 


He 


Val 


Gly 










20 








25 








30 






GGA 


GTT 


TAC 


CTG 


TTG 


CCG 


CGC 


AGG GGC 


CCC 


AGG 


TTG GGT 


GTG 


CGC 


GCG 


144 


Gly Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg Gly 


Pro 


Arg 


Leu Gly 


Val 


Arg Ala 








35 










40 






45 










ACT 


AGG 


AAG 


ACT 


TCC 


GAG 


CGG 


TCG CAA 


CCT 


CGT 


GGG AGG 


CGA 


CAA 


CCT 


192 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser Gin 


Pro 


Arg 


Gly Arg 


Arg 


Gin 


Pro 






50 










55 








60 








ATC 


CCC 


AAG 


GCT 


CGC 


CGA 


CCC 


GAG GGT 


AGG 


GCC 


TGG GCT 


CAG 


CCC 


GGG 


240 


He 


Pro 


Lys 


Ala 


Arg 


Arg 


Pro 


Glu Gly Arg 


Ala 


Trp Ala 


Gin Pro Gly 




65 










70 








75 








80 




TAC 


CCT 


TGG 


CCC 


CTC 


TAT 


GGC 


AAT GAG 


GGC 


ATG 


GGG TGG 


GCA 


GGA 


TGG 


288 


Tyr 


Pro 


Trp 


Pro 


Leu 


Tyr 


Gly 


Asn Glu Gly 


Met 


Gly Trp 


Ala Gly 


Trp 












85 








90 








95 




CTC 


CTG 


TCA 


CCC 


CGC 


GGC 


TCT 


CGG CCT 


AGT 


TGG 


GGC CCT 


ACA 


GAC 


CCC 


336 


Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg Pro 


Ser 


Trp 


Gly Pro 


Thr 


Asp 


Pro 










100 








105 








110 






CGG 


CGT 


AGG 


TCG 


CGT 


Aat 


TTG 


GGT AAG 


GTC 


ATC 


GAT ACC 


CTT 


ACA 


TGC 


384 


Arg Arg Arg 


Ser 


Arg 


Asn 


Leu 


Gly Lys 


Val 


He 


Asp Thr 


Leu 


Thr 


Cys 








115 










120 






125 








GGC 


TTC 


GCC 


GAC 


CTC 


GTG 


GGG 


TAC ATT 


CCG 


CTC 


GTC GGC 


GCC 


CCC 


CTA 


432 


Gly 


Phe 


Ala 


Asp 


Leu 


Val 


Gly 


Tyr He 


Pro 


Leu 


Val Gly 


Ala 


Pro 


Leu 






130 










135 








140 










GGG 


GGC 


GCT 


GCC 


AGG 


GCC 


CTG 


GCG CAT 


GGC 


GTC 


CGG GTT 


CTG 


GAG 


GAC 


480 
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Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 576 
Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 135 190 

GAA GTG CGC AAC GTG TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC 624 
Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 672 
Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 72 0 

Gly Cys Val Pro Cys Val Arg Glu As- Asn Ser Ser Arg Cys Tro Val 
225 230 235 " 240 

GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 768 
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 " 250 255 

ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 816 
Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 255 270 

TCC GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 864 
Ser Ala Met Tyr Val Gly Aso Leu Cys Glv Ser Val Phe Leu Val Ser 
275 280 285 

CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC 912 
Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
290 295 300 

AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG 960 
Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG 1008 
Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT 1056 
Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG 1104 
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Tro 
355 360 365 

GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT 1152 
Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG 1200 
Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 
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TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC 1248 
Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC 1296 
Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asd Ser 
420 425 430 

CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC 1344 
Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 1392 
Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asd Lys 
450 455 460 

TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG 1440 
Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

GAC CAG AGG CCC TAC .TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT 1488 
Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie 
485 490 495 

GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC 1536 
Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 

CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC 1584 
Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 

TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG 1632 
Trp Gly Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro 
530 535 540 

CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC 1680 
Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
545 550 555 560 

ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 1728 
Thr Lys Thr Cys Gly Gly Pro Pro Cvs Asn He Gly Gly Ala Glv Asn 
565 570 575 

AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 1776 
Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
580 585 590 

ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 1824 
Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 1372 
Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC 1920 
Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 1968 
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Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 2016 
Arg Asp Arg Ser Glu Leu Ser Pro leu Leu Leu Ser Thr Thr Glu Trp 
660 565 670 

CAG ATA CTG CCC TGT TCC TTC ACC ACC CTG CCG GCC CTA TCC ACC GGC 2064 
Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC GTG CAA TAC CTG TAC GGT 2112 
Leu He His Leu His Gin Asn lie Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC CTG 2160 
Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 " 720 

TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA TGG 2208 
Leu Leu Phe Leu Leu Leu Ala Aso Ala Arg He Cys Ala Cys Leu Trp 
•725 730 " 735 

ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG GTG 22 56 

Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC TTC 23 04 

Val Leu Asn Ala Ala Ala Val Ala 31 v Ala His Gly Thr Leu Ser Phe 
755 750 765 

CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC CCT 23 52 

Leu Val Phe Phe Cys Ala Ala Tro Tvr He Lys Gly Arg Leu Val Pro 
770 775 780 

GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT CTG 2400 
Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

CTG GCC TTA CCA CCA CGA GCT TAZ" GCC TAGTAA 2433 
Leu Ala Leu Pro Pro Arg Ala Tyr Ala 

805 810 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: proteir. 

(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
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35 



40 



45 



Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Tro 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

115 120 125 

Gly Phe Ala Aso Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 " 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Glu Val Arg Asn Val Ser Gly Mec Tyr His Val Thr Asn Asp Cvs Ser 
195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cvs Trp Val 
225 230 235 ~ 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asd Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Tro 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp" 
355 360 365 



Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
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370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Aso Thr Arg Gly Leu Val 
385 4 390 395 " 400 

Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Aso Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Aso Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Glv He 
485 490 495 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 " 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 

Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 
530 535 540 

Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Mec Asn Gly Thr Gly Phe 
545 550 555 560 

Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Glv Gly Ala Gly Asn 
565 570 575 

Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lvs His Pro Glu Ala 
580 * 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
595 600 605 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cvs Thr Val Asn Phe 
610 615 520 

Thr lie Phe Lys Val Arg Met Tyr Val' Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 " 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
' 660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 
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Val Gly Ser Ala Val Val Ser Leu Val lie Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 
725 730 ^ 735 

Met Met Leu Leu lie Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val 
740 745 750 

Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser Phe 
755 760 765 

Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro 
770 775 780 

Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu 
785 790 795 800 

Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
805 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: pepcide 



(ix) FEATURE : 

(A) NAME/ KEY : Modif ied-si te 

(B) LOCATION: 1..X7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys 
1 5 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRAND SDNS SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 1 . . 22 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly lie Thr Gly His Arg Met Ala Trp Asp Met Mec Met Asn Trp 
15 10 15 

Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) ST HANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY: Modified-site 
(3) LOCATION: 1..37 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr Glu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
15 10 15 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Aso Mec He Men His Thr 
20 25 ' 30 

Pro Gly Cys Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1. .25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His He Asp Leu Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1..25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
1 5 10 15 

Thr He Arg Arg His Val Aso Leu leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Leu Ser Cvs Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
15 10 15 

Ser Thr Gly Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
• (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
1 5 10 - * 15 

Asn Ser Ser He 
20 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala His Asp Ala He 
1.5 10 15 

Leu His Thr Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
15 10 15 

Pro Gly Cys Vai 
20 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
15 10 15 

Asn Val Ser 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Tro Val Ala Mec Thr Pro 
15 10 15 

Thr Val Ala Thr 

20 • 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 62: 

Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu Leu Val Gly Ser 
15 10 15 

Ala Thr Leu Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 64: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
1 5 10 15 

Asn Cys Ser lie 
20 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pepcide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser lie Tyr Pro Glv His lie Thr Gly His 
15 10 15 

Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

lie Thr Gly His Arg Met Ala Txp Asp Met Met Met Asn Tro Ser Pro 
15 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg lie 
1 5 10 15 

Pro Gin Ala He 
20 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) ST RAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg He Pro Gin Ala He Leu Asp Met He Ala Gly Ala His 
15 10 15 

Trp Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO : 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



WO 02/055548 



69 



PCT/EP02/00219 



(xi) SEQUENCE DESCRIPTION: SEQ ID NOi 70: 

Ala Gly Ala His Tru Gly Val Leu Ala Gly lie Ala Tyr Phe Ser Met 
15 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
15 10 15 

Gly Gly Gin Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(8) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys Gin Asn lie Gin 
15 10 15 

Leu lie Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn lie Gin Leu He Asn Thr Asn Gly Gin Trp His lie Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
<B) TYPE: amino acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Trp Leu Ala Gly Leu 
15 10 15 

lie Tyr Gin His Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
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15 10 15 

Gin Gly Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Asp Phe Asp Gin Gly Trp Gly Pro lie Ser Tyr Ala Asn Gly Ser 
15 10 15 

Gly Pro Asp Gin 
20 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly He Val Pro Ala Lys Ser Val 
1 5 10 15 

Cys Gly Pro Val 
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20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 80: 

Ala Lys Ser Val Cys Gly 'Pro Val Tyr Cys ?he Thr Pro Ser Pro Val 
15 10 15 

Val Val Gly Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: pepcide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Ast> Arg Ser Gly Ala Pro Thr 
1 5 10 15 

Tyr Ser Trp Gly 
20 . 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 
15 10 15 

Val Cys Gly Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys lie Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A.) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

lie Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
15 10 15 

Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly 
15 10 IS 

Ser Gly Pro Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 87: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{D} TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 
1 5 10 15 

Asn Tyr Thr He 
20 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly 
15 10 15 

Gly Val Glu His 
20 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE ' CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 
1-5 10 15 

Arg Ser Glu Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 
15 10 15 

Gin Trp Gin Val 
20 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 29 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 
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(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
CCTCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAGTGATTAT CAATTAATTG 60 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 19 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) . . 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GTCCCATCGA GTGCGGCTAC 20 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 45 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
{iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 42 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

J iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
TACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 42 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



WO 02/055548 



79 



PCT/EP02/00219 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TGTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 42 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 48 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 42 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CCGGAATGTA CCATGTCACG AACGAC 



26 
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(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GCTCCATTGT GTATGAGGCA GCGG 

(2) INFORMATION FOR SEQ ID NC: 106: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 base nairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGCTCCCGC TGCTGGGTAG CGC 23 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCTCCGTCCC CACCACGACA ATACG 
(2) INFORMATION FOR SEQ ID NO: 108: 



107: 



25 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CTACCCGGGC CACATAACGG GTCACCG 27 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: MO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
GGAGGCCTAC AACGGCCCTG GTGG 24 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TTCTATCGAT TAAATAGAAT TC 22 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
GCCATACGCT CACAGCCGAT CCC 



23 
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(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH z~2 0 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ,112 : 

Tyr Glu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
! 5 10 15 

Ser Asn Ser Ser 
20 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp 
1 5 10 . 15 

Met He Met His Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val 
- 5 10 . 15 
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Arg Glu Asn Asn Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu 
15 10 15 

Thr Pro Thr Leu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Val Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro 
15 10 15 

Thr Thr Thr lie Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu Leu Val 
15 10 15 



Gly Ala Ala Ala Phe 
20 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly 
1 " 5 10 15 

Asp Leu Cys Gly Ser 
20 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu 
15 1° 15 

Phe Thr lie Ser Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val Gin 
15 10 15 

Asp Cys Asn Cys Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr 
15 10 15 

. Gly His Arg Met Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
1 5 10 15 

Ser Pro Thr Thr Ala 
20 
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